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REAGENTS AND METHODS FOR MODULATING 
DKK-MEDIATED INTERACTIONS 

FIELD OF THE INVENTION 

The present Invention relates to signal transduction, bone development, bone 
loss disorders, modulation of lipid-related conditions, research reagents, methods of 
screening drug leads, drug development, treatments for bone and/or lipid disorders, 
screening and development of therapies, molecular, cellular, and animal models of 

» 

bone and/or lipid development and maintenance, which are mediated by Dkk, 
including Dkk-1, and/or LRP5, LRP6, HBM or other members of the Wnt pathway. 

BACKGROUND OF THE INVENTION 

Two of the most common types of osteoporosis are postmenopausal and 
senile osteoporosis. Osteoporosis affects both men and women, and, taken with 
other abnormalities of bone, presents an ever-increasing health risk for an aging 
population. The most common type of osteoporosis is that associated with 
menopause. Most women lose between 20-60% of the bone mass in the trabecular 
compartment of the bone within 3-6 years after the cessation of menses. This rapid 
bone loss is generally associated with an increase of bone resorption and formation. 
However, the resorptive cycle is more dominant and the result is a net loss of bone 
mass. Osteoporosis is a common and serious disease among postmenopausal 
women. There are an estimated 25 million women in the United States alone who 
are afflicted with this disease. The results of osteoporosis are personally harmful, 
and also account for a large economic loss due to its chronicity and the need for 
extensive and long-term support (e.g., hospitalization and nursing home care) from 
disease sequelae. This is especially true in elderly patients. Additionally, while 
osteoporosis is generally not thought of as a life-threatening condition, a 20-30% 
mortality rate is related to hip fractures in elderly women. A large percentage of this 
mortality rate can be directly associated with postmenopausal osteoporosis. 
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The most vulnerable tissue in the bone to the effects of postmenopausal 
osteoporosis is the trabecular bone. This tissue is often referred to as spongy bone 
and is particularly concentrated near the ends of the bone, near the joints, and in the 
vertebrae of the spine. The trabecular tissue is characterized by small structures 
5 which inter-connect with each other as well as the more solid and dense cortical 
tissue which makes up the outer surface and central shaft of the bone. This cris- 
cross network of trabeculae gives lateral support to the outer cortical structure and is 
critical to the biomechanical strength of the overall structure. In postmenopausal 
osteoporosis, it is primarily the net resorption and loss of the trabeculae which lead 

10 to the failure and fracture of the bone. In light of the loss of the trabeculae in 

postmenopausal women, it is not surprising that the most common fractures are 
those associated with bones which are highly dependent on trabecular support, e.g., 
the vertebrae, the neck of the femur, and the forearm. Indeed, hip fracture, Colle's 
fractures, and vertebral crush fractures are indicative of postmenopausal 

15 osteoporosis. Osteoporosis affects cortical as well as trabecular bone. Alterations in 
endosteal bone resorption and Haversian remodeling with age affect cortical 
thickness and structural integrity contributing the increased risk for fracture. 

One of the earliest generally accepted methods for treatment of 
postmenopausal osteoporosis was estrogen replacement therapy. Although this 

20 therapy frequently is successful, patient compliance is low, primarily due to the 

undesirable side-effects of chronic estrogen treatment. Frequently cited side-effects 
of estrogen replacement therapy include reinitiation of menses, bloating, depression, 
and, potentially, increased risk of breast or uterine cancer. In order to limit the 
known threat of uterine cancer in women who have not had a hysterectomy, a 

25 protocol of estrogen and progestin cyclic therapy is often employed. This protocol is 
similar to that used in birth control regimens, and often is not tolerated by women 
because of the side-effects characteristic of progestin. More recently, certain 
antiestrogens, originally developed for the treatment of breast cancer, have been 
shown in experimental models of postmenopausal osteoporosis to be efficacious. 

30 Among these agents is raloxifene (See, U.S. Patent No. 5,393,763; Black et a/., J. 
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Clin. Invest, 93:63-69 (1994); and Ettinger etal., JAMA 282:637-45 (1999)). In 
addition, tamoxifen, a widely used clinical agent for treating breast cancer, has been 
shown to increase bone mineral density in post menopausal women suffering from 
breast cancer (Love etal., N. Engl. J. Med., 326:852-856 (1992)). 
5 Another therapy for the treatment of postmenopausal osteoporosis is the use 

of calcitonin. Calcitonin is a naturally occurring peptide which inhibits bone 
resorption and has been approved for this use in many countries (Overgaard et a/., 
Br. Med. J., 305:556-561 (1992)). The use of calcitonin has been somewhat limited, 
however. Its effects are very modest in increasing bone mineral density, and the 

1 0 treatment is very expensive. Another therapy for the treatment of postmenopausal 
osteoporosis is the use of bisphosphonates. These compounds were originally 
developed for treating Pagefs disease and malignant hypercalcemia. They have 
been shown to inhibit bone resorption. Alendronate, a bisphosphonate, has been 
approved for the treatment of postmenopausal osteoporosis. These agents may be 

15 helpful in the treatment of osteoporosis, but these agents also have potential 

liabilities which include osteomalacia, extremely long half-life in bone (greater than 2 
years), and possible "frozen bone syndrome," e.g., the cessation of normal bone 
remodeling. 

Senile osteoporosis is similar to postmenopausal osteoporosis in that it is 
20 marked by the loss of bone mineral density and resulting increase in fracture rate, 
morbidity, and associated mortality. Generally, it occurs in later life, i.e., after 70 
years of age. Historically, senile osteoporosis has been more common in females, 
but with the advent of a more elderly male population, this disease is becoming a 
major factor in the health of both sexes. It is not clear what, if any, role hormones 
25 such as testosterone or estrogen have in this disease, and its etiology remains 
obscure. Treatment of this disease has not been very satisfactory. Hormone 
therapy, estrogen in women and testosterone in men, has shown equivocal results; 
calcitonin and bisphosphonates may be of some utility. 

The peak mass of the skeleton at maturity is largely under genetic control. 
30 Twin studies have shown that the variance in bone mass between adult monozygotic 
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twins is smaller than between dizygotic twins (Slemenda et al., J. Bone Miner. Res., 
6: 561-567 (1991); Young et a/., J. Bone Miner. Res., 6:561-567 (1995); Pocock et 
al., J. Clin. Invest, 80:706-710 (1987); Kelly era/., J. Bone Miner. Res., 8:11-17 
(1993)). It has been estimated that up to 60% or more of the variance in skeletal 
5 mass is inherited (Krall et al., J. Bone Miner. Res., 10:S367 (1993)). Peak skeletal 
mass is the most powerful determinant of bone mass in elderly years (Hui et al., 
Ann. Int. Med., 111:355-361 (1989)), even though the rate of age-related bone loss 
in adult and later life is also a strong determinant (Hui et al., Osteoporosis Int., 1 :30- 
34 (1995)). Since bone mass is the principal measurable determinant of fracture 
1 0 risk, the inherited peak skeletal mass achieved at maturity is an important 

determinant of an individual's risk of fracture later in life. Thus, study of the genetic 
basis of bone mass is of considerable interest in the etiology of fractures due to 
osteoporosis. 

Recently, a strong interest in the genetic control of peak bone mass has 

15 developed in the field of osteoporosis. The interest has focused mainly on candidate 
genes with suitable polymorphisms to test for association with variation in bone mass 
within the normal range, or has focused on examination of genes and gene loci 
associated with low bone mass in the range found in patients with osteoporosis. The 
vitamin D receptor locus (VDR) (Morrison etal., Nature, 367:284-287 (1994)), PTH 

20 gene (Howard et al., J. Clin. Endocrinol. Metab., 80:2800-2805 (1995); Johnson et 
al., J. Bone Miner. Res., 8:1 1-17 (1995); Gong et al., J. Bone Miner. Res., 10:S462 
(1995)) and the estrogen receptor gene (Hosoi et al., J. Bone Miner. Res., 10:S170 
(1995); Morrison etal., Nature, 367:284-287 (1994)) have figured most prominently 
in this work. These studies are difficult because bone mass (i.e. the phenotype) is a 

25 continuous, quantitative, polygenic trait, and is confounded by environmental factors 
such as nutrition, co-morbid disease, age, physical activity, and other factors. Also, 
this type of study design requires large numbers of subjects. In particular, the 
results of VDR studies to date have been confusing and contradictory (Garnero ef 
al., J. Bone Miner. Res., 10:1283-1288 (1995); Eisman etal., J. Bone. Miner. Res., 

30 10:1289-1293 (1995); Peacock, J. Bone Miner. Res., 10:1294-1297 (1995)). 
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Furthermore, thus far, the art has not determined the mechanism(s) whereby the 
genetic influences exert their effect on bone mass. 

While it is well known that peak bone mass is largely determined by genetic 
rather than environmental factors, studies to determine the gene loci (and ultimately 
5 the genes) linked to variation in bone mass are difficult and expensive. Study 

designs which utilize the power of linkage analysis, e.g., sib-pair or extended family, 
are generally more informative than simple association studies, although the latter do 
have value. However, genetic linkage studies involving bone mass are hampered by 
two major problems. The first problem is the phenotype, as discussed briefly above. 

10 Bone mass is a continuous, quantitative trait, and establishing a discrete phenotype 
is difficult. Each anatomical site for measurement may be influenced by several 
genes, many of which may be different from site to site. The second problem is the 
age component of the phenotype. By the time an individual can be identified as 
having low bone mass, there is a high probability that their parents or other members 

15 of prior generations will be deceased and therefore unavailable for study, and 

younger generations may not have even reached peak bone mass, making their 
phenotyping uncertain for genetic analysis. 

Thus, there is a need in the art for additional research tools for the elucidation 
of the molecular mechanism of bone modulation, for the screening and development 

20 of candidate drugs, and for treatments of bone development and bone loss 

disorders. The present invention is directed to these, as well as other, important 
ends. 

In addition to bone modulation, the present invention relates to modulation of 
lipid levels. Cardiovascular disease is the most common cause of mortality in the 

25 United States, and atherosclerosis is the major cause of heart disease and stroke. It 
is widely appreciated that cholesterol plays an important role in atherogenesis. 
Normally, most cholesterol serves as a structural element in the walls of cells, 
whereas much of the rest is in transit through the blood or functions as the starting 
material for the synthesis of bile acids in the liver, steroid hormones in endocrine 

30 cells and vitamin D in skin. The transport of cholesterol and other lipids through the 



5 



WO 02/092015 



PCTYUS02/15982 



circulatory system is facilitated by their packaging into lipoprotein carriers. These 
spherical particles comprise protein and phospholipid shells surrounding a core of 
neutral lipid, including unesterified ("free") or esterified cholesterol and triglycerides. 
Risk for atherosclerosis increases with increasing concentrations of low density 
5 lipoprotein (LDL) cholesterol, whereas risk is inversely proportional to levels of high- 
density lipoprotein (HDL) cholesterol. The receptor-mediated control of plasma LDL 
levels has been well-defined, and recent studies have now provided new insights 
into HDL metabolism. 

The elucidation of LDL metabolism began in 1974 by Michael Brown and 
10 Joseph Goldstein. In brief, the liver synthesizes a precursor lipoprotein (very low 

density lipoprotein, VLDL) that is converted during circulation to intermediate density 
lipoprotein (IDL) and then to LDL. The majority of the LDL receptors expressed in 
the body are on the surfaces of liver cells, although virtually all other tissues 

* » 

("peripheral tissues") express some LDL receptors. After binding, the receptor- 

15 lipoprotein complex is internalized by the cells via coated pits and vesicles, and the 
entire LDL particle is delivered to lysosomes, wherein it is dissembled by enzymatic 
hydrolysis, releasing cholesterol for subsequent cellular metabolism. This whole- 
particle uptake pathway is called "receptor-mediated endocytosis." Cholesterol- 
mediated feedback regulation of both the levels of LDL receptors and cellular 

20 cholesterol biosynthesis help ensure cellular cholesterol homeostasis. Genetic 
defects in the LDL receptor in humans results in familial hypercholesterolemia, a 
disease characterized by elevated plasma LDL cholesterol and premature 
atherosclerosis and heart attacks. One hypothesis for the deleterious effects of 
excess plasma LDL cholesterol is that LDL enters the artery wall, is chemically 

25 modified, and then is recognized by a special class of receptors called macrophage 
scavenger receptors, that mediate the cellular accumulation of the LDL cholesterol in 
the artery, eventually leading to the formation of an atherosclerotic lesion. 

The major lipoprotein classes include intestinally derived chylomicrons that 
transport dietary fats and cholesterol, hepatic-derived VLDL, IDL, and LDL that can 

30 be atherogenic, and hepatic- and intestinally-derived HDL that are antiatherogenic. 
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Apoprotein B (ApoB) is necessary for the secretion of chylomicrons (ApoB48) and 
VLDL, IDL, and LDL (ApoB100). Plasma levels of VLDL triglycerides are determined 
mainly by the rates of secretion in LDL lipolytic activity. Plasma levels of LDL 
cholesterol are determined mainly by the secretion of ApoB100 into plasma, the 
5 efficacy with which VLDL are converted to LDL and by LDL receptor-mediated 

clearance. Regulation of HDL cholesterol levels is complex and is affected by rates 
of synthesis of its Apo proteins, rates of esterification of free cholesterol to 
cholesterol ester by LCAT, levels of triglyceride-rich lipoproteins and CETP-mediated 
transfer of cholesterol esters from HDL, and clearance from plasma of HDL lipids 

10 and Apo proteins. 

Normal lipoprotein transport is associated with low levels of triglycerides and 
LDL cholesterol and high levels of HDL cholesterol. When lipoprotein transport is 
abnormal, lipoprotein levels can change in ways that predispose individuals to 
atherosclerosis and arteriosclerosis (see Ginsburg, Endocrinol. Metab. Clin. North 

15 Am., 27:503-19 (1998)). 

Several lipoprotein receptors may be involved in cellular lipid uptake. These 
receptors include: scavenger receptors; LDL receptor-related protein/a2- 
macroglobulin receptor (LRP); LDL receptor; and VLDL receptor. With the exception 
of the LDL receptor, all of these receptors are expressed in atherosclerotic lesions 

20 while scavenger receptors are mostly expressed in macrophages, the LRP and 
VLDL receptors may play an important role in mediating lipid uptake in smooth 
muscle cells (Hiltunen etal., Atherosclerosis, 137 suppl.:S81-8 (1998)). 

A major breakthrough in the pharmacologic treatment of hypercholesterolemia 
has been the development of the "statin" class of 3-hydroxy-3-methylglutaryl-CoA 

25 reductase (HMG CoA reductase) inhibitory drugs. 3-hydroxy-3-methylglutaryl-CoA 
reductase is the rate controlling enzyme in cholesterol biosynthesis, and its inhibition 
in the liver stimulates LDL receptor expression. As a consequence, both plasma 
LDL cholesterol levels and the risk for atherosclerosis decrease. The discovery and 
analysis of the LDL receptor system has had a profound impact on cell biology, 

30 physiology, and medicine. 
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HDL is thought to remove unesterified, or "free" cholesterol (FC) from 
peripheral tissues, after which most of the cholesterol is converted to cholesterol 
ester (CE) by enzymes in the plasma. Subsequently, HDL cholesterol is efficiently 
delivered directly to the liver and steroidogenic tissues via a selective uptake 
pathway and the HDL receptor, SR-BI (class B type I scavenger receptor) or, in 
some species, transferred to other lipoproteins for additional transport in metabolism 
(see Krieger, Proc. Natl. Acad. Sci. USA, 95:4077-4080 (1998)). 

These issues illustrate a need in the art for additional research tools for the 
elucidation of the molecular mechanism of lipid modulation, for the screening and 
development of candidate drugs, and for treatments of lipid levels and lipid level 
modulation disorders. The present invention is directed to these, as well as other, 
important ends. 

SUMMARY OF THE INVENTION 
The present invention provides reagents, compounds, compositions and 
methods relating to novel interactions of the extracellular domain of LRP5, HBM (a 
variant of LRP5), and/or LRP6 with Dkk proteins. LRP5 is also referred to as Zmaxl 
or Zmax. Thus, when discussing methods, reagents, compounds, and compositions 
of the invention which relate to the interaction between Dkk and LRP5 (or Zmaxl ), 
the invention is also to be understood to encompass embodiments relating to 
interactions between Dkk and LRP6 and Dkk and HBM. Moreover, where Dkk is 
discussed herein, it is to be understood that the methods, reagents, compounds, and 
compositions of the present invention include the Dkk family members, including but 
not limited to Dkk-1 , Dkk-2, Dkk-3, Dkk-4 and Soggy. Furthermore, the invention 
encompasses novel fragments of Dkk-1 which demonstrate a binding interaction 
between the ligand binding domain (LBD) of LRP5 and additional proteins and/or 
which can modulate an interaction between LRP5, or a variant or fragment thereof, 
and a Dkk protein. The invention provides assays, methods, compositions, and 
compounds relating to Dkk-Wnt signaling. Numerous Wnt proteins are compatible 
with the present invention, including Wnt1-Wnt19, and particularly, Wnt1, Wnt3, 



8 



WO 02/092015 



PO7US02/15982 



Wnt3a, and WntlOb. The present invention further provides reagents, compounds, 
compositions and methods modulating interactions between one or more other 
proteins and Dkk-1 . The present invention also provides a series of peptide 
aptamers which bind to Dkk-1 or to LRP5 (or HBM and/or LRP6). 
5 The polypeptides of the invention, for example in the form of peptide 

oligomers, aptamers, proteins, and protein fragments as well as the nucleic acids of 
the invention, for example in the form of nucleic acids which encode the polypeptides 
of the invention as well as antisense, or complimentary nucleic acids, are useful as 
reagents for the study of bone mass and lipid level modulation. The polypeptides 
10 and nucleic acids of the invention are also useful as therapeutic and diagnostic 
agents. 

The present invention provides useful reagents for the modulation of Dkk 
proteins with LRP5, LRP6, and/or HBM, the modulation Dkk-1 and/or Dkk-1 
interacting protein activity, and modulation of LRP5/Dkk-1 , LRP6/Dkk1 and 

15 HBM/Dkk-1 interactions and Dkk-1 /Dkk-1 interacting protein interactions. The 

present invention provides a series of peptide aptamers which bind Dkk-1 or LRP5, 
LRP6, and/or HBM. 

An object of the invention is to provide for a method of regulating 
LRP5/LRP6/HBM/HBM-like activity in a subject comprising administering a 

20 therapeutically effective amount of a composition which modulates Dkk activity. The 
subject can be a vertebrate or an invertebrate organism, but more preferably the 
organism is a canine, a feline, an ovine, a primate, an equine, a porcine, a caprine, 
a camelid, an avian, a bovine, or a rodent organism. A more preferred organism is a 
human. In a preferred embodiment, the Dkk protein is Dkk-1. In a particularly 

25 preferred embodiment, Dkk-1 activity is decreased. In another embodiment, Dkk 
activity modulates bone mass and/or lipid levels. In a preferred embodiment, bone 
mass is increased and/or lipid levels are decreased. In another preferred 
embodiment, the modulation in bone mass is an increase in bone strength 
determined via one or more of a decrease in fracture rate, an increase in areal bone 

30 density, an increase in volumetric mineral bone density, an increase in trabecular 
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connectivity, an increase in trabecular density, an increase in cortical density or 
thickness, an increase in bone diameter, and an increase in inorganic bone content. 
The invention further provides such a method wherein the composition comprises a 
Dkk, Dkk-1 or a LRP5/LRP6/HBM binding fragment thereof, such as those depicted 
5 in Figure 6 or a mimetic of those fragments depicted in Figure 6. The invention 

further provides such a method wherein the composition comprises one or more of 
the proteins which interact with Dkk, including Dkk-1 , such as those depicted in 
Figure 5, or a Dkk-binding fragment thereof, or an antisense, siRNA, or shRNA 
molecule which recognizes and binds to a nucleic acid encoding one or more Dkk 

10 interacting or Dkk-1 interacting proteins. The invention further provides such a 

method wherein the composition comprises an LRP5/LRP6/Zmax1 antibody, Dkk 
antibody, a Dkk-1 antibody or an antibody to a Dkk-1 interacting protein. The 
invention further provides such a method wherein the compositions comprise an 
aptamer of Dkk or Dkk-1, such as those depicted in Figure 3 (SEQ ID NOs:171-188), 

15 or a mimetic of such an aptamer. The method further provides that invention further 
provides such a method wherein the compositions comprise an aptamer of a Dkk 
interacting or Dkk-1 interacting protein, or a mimetic of such an aptamer. 

A composition of the present invention may modulate activity either by 
enhancing or inhibiting the binding of Dkk to LRP5/LRP6/Zmax1 , particularly Dkk-1 , 

20 or the binding of Dkk-1 to a Dkk-1 interacting protein, such as those shown in Figure 
5. A composition of the present invention may comprise an LRP5 peptide aptamer, 
such as OST262 (SEQ ID NO:208), Figures 4 (SEQ ID NOs:189-192) (particularly, 
peptide (SEQ ID NO:191) and 13 (including SEQ IDNOs:204-214), or a mimetic of 
such an aptamer. Preferred compositions of the present invention also comprise 

25 LRP5 antibodies. 

Another aspect of the invention is to provide for a method of regulating Dkk- 
Wnt pathway activity in a subject comprising administering a therapeutically effective 
amount of a composition which modulates Dkk-Wnt pathway activity. In a preferred 
embodiment, the Dkk protein is Dkk-1 . In a particularly preferred embodiment, Dkk- 

30 1 activity is decreased. In another embodiment, Dkk activity modulates bone mass 
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and/or lipid levels. In a preferred embodiment, bone mass is increased and/or lipid 
levels are decreased. In another preferred embodiment, the modulation in bone 
mass is an increase in bone strength determined via one or more of a decrease in 
fracture rate, an increase in areal bone density, an increase in volumetric mineral 
bone density, an increase in trabecular connectivity, an increase in trabecular 
density, an increase in cortical density or thickness, an increase in bone diameter, 
and an increase in inorganic bone content. In another preferred embodiment, the 
Wnt is Wnt1-Wnt19. In a particularly preferred embodiment, the Wnt is Wnt1, Wn3, 
Wnt3a, or Wnt10b. Preferred compositions comprise Dkk-modulating or Dkk-1 - 
modulating compounds or one or more Dkk interacting or Dkk-1 interacting proteins, 
or a Dkk-binding fragment thereof. Other preferred Dkk modulating compositions 
comprise a Dkk or Dkk-1 antibody or an antibody to a Dkk interacting or Dkk-1 
interacting protein. Also contemplated are antisense, siRNA, and shRNA molecules 
which recognize and bind to a nucleic acid encoding one or more Dkk-1 interacting 
proteins. The invention further provides such a method wherein the composition 
comprises a biologically active or LRP5/LRP6/HBM binding fragment of Dkk, 
including Dkk-1 , such as those depicted in Figure 6 or a mimetic of those fragments 
depicted in Figure 6. The Dkk modulating composition may also comprise a peptide 
aptamer of a Dkk interacting or Dkk-1 interacting protein, or a mimetic of such an 
aptamer. A composition of the present invention may modulate activity either by 
enhancing or inhibiting the binding of Dkk, including Dkk-1, to LRP5, LRP6, or HBM 
or the binding of Dkk, including Dkk-1, to a Dkk interacting protein, such as those 
shown in Figure 5. The invention further provides such a method wherein the 
composition comprises an aptamer of Dkk or Dkk-1 , such as those depicted. A 
composition of the present invention may comprise an LRP5 peptide aptamer, such 
as OST262 (SEQ ID NO:208). Preferred compositions of the present invention also 
comprise LRP5 antibodies. 

A further aspect of the invention is to provide for a method of modulating Wnt 
signaling in a subject comprising administering a therapeutically effective amount of 
a composition which modulates Dkk activity or modulates Dkk interaction with LRP5 
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(or LRP6 or HBM). In a preferred embodiment, the Dkk protein is Dkk-1. In a 
particularly preferred embodiment, Dkk-1 activity is decreased. In another 
embodiment, Dkk activity modulates bone mass and/or lipid levels. In a preferred 
embodiment, bone mass is increased and/or lipid levels are decreased. In another 
preferred embodiment, the modulation in bone mass is an increase in bone strength 
determined via one or more of a decrease in fracture rate, an increase in areal bone 
density, an increase in volumetric mineral bone density, an increase in trabecular 
connectivity, an increase in trabecular density, an increase in cortical density or 
thickness, an increase in bone diameter, and an increase in inorganic bone content. 

■ 

In another preferred embodiment, the Wnt is Wnt1-Wnt19. In a particularly preferred 
embodiment, the Wnt is Wnt1 , Wnt3, Wnt3a, or Wnt10b. Preferred Wnt modulating 
compositions comprise one or more Dkk interacting or Dkk-1 interacting proteins, or 
a biologically active or LRP5/LRP6/HBM binding fragment thereof. Also 
contemplated are antisense, siRNA, and shRNA molecules which recognize and 
bind to a nucleic acid encoding one or more Dkk interacting or Dkk-1 interacting 
proteins. The invention further provides such a method wherein the composition 
comprises a biologically active or LRP5/LRP6/HBM binding fragment of Dkk or Dkk- 
1, such as those depicted in Figure 6 or a mimetic of those fragments depicted in 
Figure 6. The Dkk modulating composition may also comprise a peptide aptamer of 
a Dkk interacting or Dkk-1 interacting protein, or a mimetic of such an aptamer. A 
composition of the present invention may modulate activity either by enhancing or 
blocking the binding of Dkk, including Dkk-1 , to LRP5, LRP6, or HBM or the binding 
of Dkk or Dkk-1 to a Dkk interacting or Dkk-1 interacting protein, such as those 
shown in Figure 5. The invention further provides such a method wherein 
compositions comprising an aptamer of Dkk or Dkk-1 , such as those depicted in 
Figure 3 (SEQ ID NOs:171-188), or a mimetic of such an aptamer. The invention 
further provides such a method wherein the composition comprises a Dkk or Dkk-1 
antibody or an antibody to a Dkk interacting or Dkk-1 interacting protein. The 
invention further provides such a method wherein compositions of an LRP5 peptide 
aptamer, such as OST262 (SEQ ID NO:208), Figures 4 (SEQ ID NO:189-192 
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(particularly peptide (SEQ ID N0:191) and Figure 13 (including SEQ ID NOs:204- 
214), or a mimetic of such an aptamer. Additional preferred compositions of the 
present invention also comprise LRP5 antibodies. 

Additionally, the invention provides for a method of modulating bone mass 
and/or lipid levels in a subject comprising administering to the subject a composition 
which modulates Dkk activity or Dkk interaction with LRP5 in an amount effective to 
modulate bone mass and/or lipid levels, wherein bone mass and/or lipid levels are in 
need of modulation. In a preferred embodiment, the Dkk protein is Dkk-1. In a 
particularly preferred embodiment, Dkk-1 activity is decreased. In another 
embodiment, Dkk activity modulates bone mass and/or lipid levels. In a preferred 
embodiment, bone mass is increased and/or lipid levels are decreased. In another 
preferred embodiment, the modulation in bone mass is an increase in bone strength 
determined via one or more of a decrease in fracture rate, an increase in areal bone 
density, an increase in volumetric mineral bone density, an increase in trabecular 
connectivity, an increase in trabecular density, an increase in cortical density or 
thickness, an increase in bone diameter, and an increase in inorganic bone content. 
Preferred bone mass and/or lipid modulating compositions comprise one or more 
Dkk interacting or Dkk-1 interacting proteins, or a biologically active or 
LRP5/LRP6/HBM binding fragment thereof. Also contemplated are antisense, 
siRNA, and shRNA molecules which recognize and bind to a nucleic acid encoding 
one or more Dkk interacting or Dkk-1 interacting proteins. The invention further 
provides such a method wherein the composition comprises a biologically active or 
LRP5/LRP6/HBM binding fragment of Dkk, including Dkk-1, such as those depicted 
in Figure 6 or a mimetic of those fragments depicted in Figure 6. The Dkk 
modulating composition may also comprise a peptide aptamer of a Dkk interacting or 
Dkk-1 interacting protein, or a mimetic of such an aptamer. The invention further 
provides such a method wherein the composition comprises an aptamer of Dkk or 
Dkk-1, such as those depicted in Figure 3 (SEQ ID NOs:171-188), or a mimetic of 
such an aptamer. A composition of the present invention may modulate activity 
either by enhancing or inhibiting the binding of Dkk, including Dkk-1, to LRP5, LRP6, 
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or HBM or the binding of Dkk, including Dkk-1, to a Dkk interacting protein, such as 
those shown in Figure 5. The invention further provides such a method wherein the 
composition comprises a Dkk or Dkk-1 antibody or an antibody to a Dkk interacting 
or Dkk-1 interacting protein. A composition of the present invention may comprise 
an LRP5 peptide aptamer, such as OST262 (SEQ ID NO:208), Figures 4 (SEQ ID 
NOs:189-192 (particularly peptide 13 (SEQ ID NO:191)) and 13 (including SEQ ID 
NOs:204-214), or a mimetic of such an aptamer. Preferred compositions of the 
present invention also comprise LRP5 antibodies. It is a further aspect of the 
invention that such lipid-modulated diseases include a cardiac condition, 
atherosclerosis, familial lipoprotein lipase deficiency, familial apoprotein CI I 
deficiency, familial type 3 hyperlipoproteinemia, familial hypercholesterolemia, 
familial hypertriglyceridemia, multiple lipoprotein-type hyperlipidemia, elevated lipid 
levels due to dialysis and/or diabetes, and an elevated lipid level of unknown 
etiology. 

Bone disorders contemplated for treatment and/or diagnosis by the methods 
and compositions disclosed herein include a bone development disorder, a bone 
fracture, age related loss of bone, a chondrodystrophy, a drug-induced bone 
disorder, high bone turnover, hypercalcemia, hyperostosis, osteogenesis imperfecta, 
osteomalacia, osteomyelitis, osteoporosis, Paget's disease, osteoarthritis, and 
rickets. 

It is a further object of the invention to provide a method of screening for 
compounds or compositions which modulates the interaction of Dkk with LRP5, 
LRP6, HBM, or a Dkk-binding fragment of LRP5, LRP6, or HBM comprising: 

(a) exposing Dkk or a LRP5/LRP6/HBM binding 
fragment thereof to a compound; and 

(b) determining whether said compound binds to Dkk or the 
LRP5/LRP6/HBM binding fragment thereof. 

In a preferred embodiment, the Dkk is Dkk-1 . In a particularly preferred 
embodiment, the binding of Dkk-1 to LRP5/LRP6/HBM is decreased. 
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It is a further object of the invention to provide a method of screening 
compounds or compositions which modulate the interaction of DKK with LRP5, 
LRP6, HBM, or a DKK-finding fragment thereof comprising: 

(a) exposing DKK or a LRP5/LRP6/HBM binding 
fragment thereof to a compound; and, 

(b) determining whether said compound modulates 
the interaction of Dkk with LRP5, LRP6, or HBM, 
or the Dkk-binding fragment of LRP5/LRP6/HBM. 

In a preferred embodiment, the Dkk is Dkk-1. In a particularly preferred 
embodiment, the interaction of Dkk-1 with LRP5/LRP6/HBM is decreased. 

It is a further object of the invention to provide a method of screening for 
compounds or compositions which modulates the interaction of Dkk with LRP5, 
LRP6, HBM, or a Dkk-binding fragment of LRP5, LRP6, or HBM comprising: 

(a) exposing Dkk or a LRP5/LRP6/HBM binding fragment 
thereof to a compound; 

(b) determining whether said compound binds to Dkk or the 
LRP5/LRP6/HBM binding fragment thereof; and, 

(c) further determining whether said compound modulates 
the interaction of Dkk with LRP5, LRP6, or HBM, or the 
Dkk-binding fragment of LRP5/LRP6/HBM. 

In preferred embodiments of such methods, Dkk or a biologically active 
fragment thereof is attached to a solid substrate. In an alternative embodiment of 
the invention, LRP5/LRP6/HBM, or a biologically active fragment thereof (such as 
the ligand binding domain), is exposed to the compound. Another aspect of the 
invention provides for compounds and compositions identified by the disclosed 
methods. A preferred embodiment of the invention provides that the compound 
screened in an afore-mentioned method is one or more proteins which interact with 
Dkk, particularly Dkk-1, as depicted in Figure 5, or a LRP5/LRP6/HBM-binding 
fragment thereof. Another preferred embodiment provides that the compound 
comprises a Dkk or Dkk-1 peptide aptamer, such as those depicted in Figure 3 (SEQ 
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ID NOs:171-188), or a mimetic of such aptamers. The compound may also 
comprise a peptide aptamer of a Dkk interacting or Dkk-1 interacting protein, or a 
mimetic of such an aptamer. The method further provides that the compound 
comprises a Dkk or Dkk-1 antibody or an antibody to a Dkk-1 interacting protein. 
5 The invention further provides that the compound may comprise an LRP5 peptide 
aptamer, such as OST262 (SEQ ID NO:208), Figure 4 (SEQ ID NOs: 189-1 92) 
(particularly peptide 13 (SEQ ID NO:191)) and Figure 13 (including SEQ ID 
NOs:204-214), or a mimetic of such an aptamer. Preferred compounds of the 
present invention also comprise LRP5 antibodies. 
10 It is a further object of the invention to provide a method of screening for 

compounds or compositions which modulate the interaction of Dkk and a Dkk 
interacting protein comprising: 

(a) exposing a Dkk interacting proteins or a Dkk- 
binding fragment thereof to a compound; and, 
15 (b) determining whether said compound binds to a 

Dkk interacting proteins or the Dkk-binding 
fragment thereof. 
In a preferred embodiment, the Dkk is Dkk-1. 

It is a further object of the invention to provide a method of screening for 
20 compounds or compositions which modulate the interaction of Dkk and a Dkk 
interacting protein comprising: 

(a) exposing Dkk interacting protein(s) or a Dkk- 
binding fragment thereof to a compounds; and, 

(b) determining whether said compound modulates 
25 the interaction of Dkk and Dkk interacting proteins. 

It is a further object of the invention to provide a method of screening for 
compounds or compositions which modulate the interaction of Dkk and a Dkk 
interacting protein comprising: 

(a) exposing a Dkk interacting proteins or a Dkk- 
30 binding fragment thereof to a compound; 
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(b) determining whether said compound binds to a 
Dkk interacting proteins or the Dkk-binding 
fragment thereof; and, 

(c) further determining whether said compound 
modulates the interaction of Dkk and Dkk 
interacting proteins. 

In a preferred embodiment, Dkk is Dkk-1 . 

In preferred embodiments of such methods, the Dkk interacting proteins, 
particularly Dkk-1 interacting proteins, or a Dkk-binding fragment thereof are 
attached to a solid substrate. Another aspect of the invention provides for 
compounds and compositions identified by the disclosed methods. A preferred 
embodiment provides that the compound comprises a Dkk or Dkk-1 peptide 
aptamer, such as those depicted in Figure 3 (SEQ ID NOs:171-188), or a mimetic of 
such aptamers. The compound may also comprise a peptide aptamer of a Dkk 
interacting or Dkk-1 interacting protein, or a mimetic of such an aptamer. The 
compound may also comprise an antibody to a Dkk interacting or Dkk-1 interacting 
protein. 

It is another object of the invention to provide for a composition for treating 
bone mass disorders comprising a LRP5/LRP6/HBM modulating compound and a 
pharmaceutical^ acceptable excipient and/or carrier therefor. Preferred LRP5 (or 
LRP6 or HBM) modulating compounds include Dkk or Dkk-1 or a LRP5/LRP6/HBM 
binding fragment thereof. Also contemplated are compounds which comprise 
monoclonal or polyclonal antibodies or immunologically active fragments thereof 
which bind to Dkk, including Dkk-1, and a pharmaceutical^ acceptable excipient 
and/or carrier. Another preferred embodiment provides that the modulating 
compound comprises one or more Dkk interacting or Dkk-1 interacting proteins, or a 
biologically active fragment thereof. Also contemplated are compounds which 
comprise monoclonal or polyclonal antibodies or immunologically active fragments 
thereof which bind to Dkk interacting or Dkk-1 interacting proteins, or a biologically 
active fragment thereof, and a pharmaceutical^ acceptable excipient and/or carrier. 
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Another preferred embodiment provides that the modulating compound comprises 
an antisense, siRNA, and shRNA molecule which recognizes and binds to a nucleic 
acid encoding a Dkk interacting or Dkk-1 interacting protein. Another preferred 
embodiment provides that the modulating compound comprises a Dkk or Dkk-1 
peptide aptamer, a mimetic of a Dkk or Dkk-1 peptide aptamer, a peptide aptamer of 
a Dkk interacting or Dkk-1 interacting protein, or a mimetic of such an aptamer. 
Another embodiment provides that the compound comprises an LRP5 peptide 
aptamer, such as OST262 (SEQ ID NO:208), Figure 4 (SEQ ID NOs:189-192) 
(particularly peptide) and Figure 13 (including SEQ ID NOs:204-214), or a mimetic of 
such an aptamer. Preferred compounds of the present invention also comprise 
LRP5 antibodies. 

It is a further object of the invention to provide a pharmaceutical composition 
for treating a Dkk-mediated disease or condition comprising a compound which 
modulates Dkk activity and a carrier therefor, including pharmaceutical^ acceptable 
excipients. Such compositions include those wherein the compound comprises an 
antisense, siRNA, and shRNA molecule or an antibody which binds to Dkk, including 
Dkk-1 , and thereby prevents it from interacting with LRP5, LRP6, or HBM. Other 
such compositions include one or more of Dkk interacting or Dkk-1 interacting 
proteins, such as those depicted in Figure 5, or a Dkk-binding fragment thereof, or a 
monoclonal or polyclonal antibody, or immunologically active fragment thereof, which 
binds to a Dkk interacting or Dkk-1 interacting protein or Dkk-binding fragment 
thereof. Other contemplated compositions include antisense, siRNA, and shRNA 
molecules which recognize and bind to a nucleic acid encoding a Dkk interacting or 
Dkk-1 interacting protein. Further contemplated compositions include Dkk and Dkk-1 
peptide aptamers, such as those depicted in Figure 3 (SEQ ID NOs;171-188), 
mimetics of such aptamers, a peptide aptamer of a Dkk interacting or Dkk-1 
interacting protein, or a mimetic of such an aptamer. Other contemplated 
compositions comprise an LRP5 peptide aptamer, such as OST262 (SEQ ID 
NO:208), Figure 4 (SEQ ID NOs:189-192) (particularly peptide 13 (SEQ ID NO:191)) 
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and Figure 13 (including SEQ ID NO:204-214), or a mimetic of such an aptamer. 
Other preferred compositions of the present invention comprise LRP5 antibodies. 

A further object of the invention to provide for a method of modulating the 
expression of a nucleic acid encoding a Dkk interacting or Dkk-1 interacting protein 
5 in an organism, such as those shown in Figure 5, comprising the step of 

administering to the organism an effective amount of composition which modulates 
the expression of a nucleic acid encoding a Dkk-1 interacting protein. In a preferred 
embodiment, said composition comprises an antisense, siRNA, or shRNA molecule 
which recognizes and binds to a nucleic acid encoding a Dkk interacting or Dkk-1 

10 interacting protein. 

One aspect of the invention provides for a method of modulating at least one 
activity of Dkk or a Dkk-1 interacting protein comprising administering an effective 
amount of a composition which modulates at least one activity of Dkk or a Dkk-1 
interacting protein. The invention provides for a composition comprising a Dkk 

15 interacting or Dkk-1 interacting protein, such as those shown in Figure 5, or a 

biologically active fragment thereof. Other agents contemplated for this method are 
antisense, siRNA, or shRNA molecules which recognize and bind to a nucleic acid 
encoding a Dkk interacting or Dkk-1 interacting protein. The method further provides 
that the composition comprises a Dkk or Dkk-1 antibody or an antibody to a Dkk 

20 interacing or Dkk-1 interacting protein. In another preferred embodiment, the 

composition comprises a Dkk or Dkk-1 peptide aptamer, a mimetic of a Dkk or Dkk-1 
peptide aptamer, a peptide aptamer of a Dkk interacting or Dkk-1 interacting protein, 
or a mimetic of such an aptamer. The method provides that a composition of the 
present invention may comprise an LRP5 peptide aptamer, such as OST262 (SEQ 

25 ID NO:208), Figure 4 (SEQ ID NO:189-192) (particularly peptide including (SEQ ID 
NO:191)) and Figure including (SEQ ID NOs:204-214), or a mimetic of such an 
aptamer. Preferred compositions of the present invention also comprise LRP5 
antibodies. In a further preferred embodiment, the modulated Dkk activity is lipid 
modulation or bone mass modulation. 
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In all of the testing/screening embodiments of the present invention discussed 
below to obtain compounds or compositions which ultimately impact 
LRP5/LRP6/HBM signaling, one skilled in the art will recognize that HBM can be 
used as a control in the absence of a test sample or compound. Further, the effect 
of a test sample of compound on Wnt signaling through the interaction of Dkk with 
LRP5/LRP6/HBM does not necessarily require a direct measurement of an 
association or interaction of Dkk and LRP5/LRP6/HBM. Other positive 
phenotypes/activities established by the High Bone Mass phenotype or by using 
HBM as a control. 

One aspect of the invention provides for a method of identifying binding 
partners for a Dkk protein comprising the steps of: 

(a) exposing the Dkk protein(s) or a LRP5/LRP6 binding fragment 
thereof to a potential binding partner; and 

(b) determining if the potential binding partner binds to a Dkk protein or 
the LRP5/LRP6 binding fragment thereof. 

In a preferred embodiment, the Dkk is Dkk-1. 

Another aspect of the invention is to provide for a method of identifying a 
compound that effects Dkk-mediated activity comprising 

(a) providing a group of transgenic animals having (1 ) 
a regulatable one or more Dkk interacting protein 
genes, (2) a knock-out of one or more Dkk 
interacting protein genes, or (3) a knock-in of one 
or more Dkk interacting protein genes; 

(b) providing a second group of control animals 
respectively for the group of transgenic animals in 
step (a); and 

(c) exposing the transgenic animal group and the 
control animal group to a potential Dkk-modulating 
compound which modulates bone mass or lipid 
levels; and 
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(d) comparing the transgenic animal group and the 
control animal group and determining the effect of 
the compound on bone mass or lipid levels in the 
transgenic animals as compared to the control 
animals. 

In a preferred embodiment, the Dkk is Dkk-1. 

It is another aspect of the invention to provide for a method for determining 
whether a compound modulates a Dkk interacting protein, said method comprising 
the steps of: 

(a) mixing the Dkk interacting protein or a Dkk-binding 
fragment thereof with the ligand binding domain of 
Dkk in the presence of said at least one 
compound; 

(b) measuring the amount of said binding domain of 
Dkk bound to said Dkk interacting protein or the 
Dkk-binding fragment thereof as compared to a 
control without said at least one compound; and 

(c) determining whether the compound reduces the 
amount of said binding domain of Dkk binding to 
said Dkk interacting protein or Dkk-binding 
fragment thereof. 

In a preferred embodiment, the Dkk is Dkk-1. 

In a preferred embodiment, the binding domain is attached to a solid 
substrate. The invention further provides for compounds identified by this method. 
In a preferred embodiment, the invention provides that the Dkk interacting or Dkk-1 
interacting protein is detected by antibodies. In another preferred embodiment, the 
solid substrate is a microarray. Another preferred embodiment provides that the 
ligand binding domain of Dkk and/or Dkk interacting protein is fused or conjugated to 
a peptide or protein. The invention also provides that the compounds include Dkk 
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and Dkk-1 peptide aptamers, mimetics of Dkk and Dkk-1 peptide aptamers, Dkk and 
Dkk-1 interacting proteins peptide aptamers, or mimetics of such aptamers. 

An aspect of the invention provides a composition comprising one or more 
polypeptide sequences of one or more Dkk-1 interacting proteins, or a biologically 
active fragment thereof, one or more Dkk proteins, or a biologically active fragment 
thereof, or LRP5/LRP6/HBM polypeptide sequences or a biologically active fragment 
thereof (for example, the ligand binding domain) and a pharmaceutical^ acceptable 
excipient and/or carrier. Another aspect of the invention provides that the 
composition comprises a Dkk or Dkk-1 antibody or an antibody to a Dkk interacting 
or Dkk-1 interacting protein and a pharmaceutically acceptable excipient. A 
composition of the present invention may comprise an LRP5 peptide aptamer, such 
as OST262 (SEQ ID NO:208), Figure 4 (SEQ ID NOs: 189-1 92) (particularly peptide 
13 (SEQ ID NO:191)) and Figure 13 (including SEQ ID NOs:204-214), or a mimetic 
of such an aptamer. A composition of the present invention may comprise a Dkk 
peptide aptamer, for example as shown in Figure 3 (SEQ ID NOs:171-188). 
Preferred compositions of the present invention also comprise LRP5 antibodies. 

Another aspect of the invention is to provide an antibody or immunologically 
active antibody fragment which recognizes and binds to a Dkk-1 amino acid 
sequence selected from the group consisting of: Asn34-His266 (SEQ ID NO:110), 
Asn34-Cys245 (SEQ ID NO:111), Asn34-I_ys1 82 (SEQ ID NO:112), Cys97-His266 
(SEQ ID NO:113), Val139-His266 (SEQ ID NO:114), Gly183-His266 (SEQ ID 
NO: 115), Cys97-Cys245 (SEQ ID NO: 116), or Val139-Cys245 (SEQ ID NO: 117) of 
human Dkk-1. Additional antibodies may bind to any of the sequences depicted in 
Figures 3 (SEQ ID NOs:171-188) and Figure 4 (SEQ ID NOs:189-192). Another 
aspect of the invention is to provide for polyclonal antibodies to one or more amino 
acid sequences: Peptide 1 -GNKYQTIDNYQPYPC (SEQ ID NO:118), Peptide 2 - 
LDGYSRRTTLSSKMYHTKGQEG (SEQ ID NO:119), Peptide 3 - 
RIQKDHHQASNSSRLHTCQRH (SEQ ID NO:120). Peptide 4 - RGEIEETITESFGND 
(SEQ ID NO:121), and Peptide 5 - EIFQRCYCGEGLSCRIQKD (SEQ ID NO: 122). 
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It is a further object of the invention to provide a nucleic acid encoding a Dkk 
protein, e.g. Dkk-1, a Dkk interacting or Dkk-1 interacting protein aptamer, or an 
LRP5 aptamer comprising a nucleic acid encoding a scaffold protein in-frame with 
the activation domain of Gal4 or LexA that is in-frame with a nucleic acid which 
encodes for a Dkk or Dkk-1 or Dkk interacting or Dkk-1 interacting protein amino acid 
sequence. Preferably the scaffold protein is thioredoxin (trxA), S1 nuclease from 
Staphylococcus or M13. Other preferable embodiments include Dkk-1 amino acid 
sequences selected from Figure 6. 

It is yet a further object of the invention to provide a composition comprising a 
polypeptide sequence of Figure 3 (SEQ ID NOs:171-188), Figure 4 (SEQ ID 
NO:189-192), or of Dkk-1 interacting proteins identified in Figure 5 and a 
pharmaceutical^ acceptable excipient and/or carrier. 

Another aspect of the invention includes a method of detecting the modulatory 
activity of a compound on the binding interaction of a first peptide and a second 
peptide of a peptide binding pair that bind through extracellular interaction in their 
natural environment, comprising: 

(i) culturing at least one eukaryotic cell, wherein the eukaryotic cell 
comprises; > 

a) a nucleotide sequence encoding a first heterologous 
fusion protein comprising the first peptide or a segment 
thereof joined to a DNA binding domain of a 
transcriptional activation protein; 

b) a nucleotide sequence encoding a second heterologous 
fusion protein comprising the second peptide or a 
segment thereof joined to a transcriptional activation 
domain of a transcriptional activation protein; 

wherein binding of the first peptide or segment thereof and the second 
peptide or segment thereof reconstitutes a transcriptional activation 
protein; and 
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c) a reporter element activated under positive transcriptional 
control of the reconstituted transcriptional activation 
protein, wherein expression of the reporter element 
produces a selected phenotype; 

(ii) incubating a compound with the eukaryotic cell under conditions 
suitable to detect the selected phenotype; and 

(iii) detecting the ability of the compound to affect the binding interaction 
of the peptide binding pair by determining whether the compound 
affects the expression of the reporter element which produces the 
selected phenotype; 

wherein (1) said first peptide is a Dkk peptide and said second peptide is a 
peptide selected from LRP5, HBM, LRP6, and the Dkk-binding portion of 
LRP5/LRP6/HBM or (2) said first peptide is a Dkk-interacting protein or the 
Dkk-binding fragment thereof, and said second peptide is a Dkk peptide. 
In one embodiment, the eukaryotic cell is a yeast cell. In a preferred 
embodiment, the yeast cell is Saccharomyces. In a particularly preferred 
embodiment, the Saccharomyces cell is Saccharomyces cerevisiae. The invention 
further provides that the compound may comprise a Dkk interacting or Dkk-1 
interacting protein, or a biologically active fragment thereof. In one embodiment, the 
Dkk interacting or Dkk-1 interacting protein, or a Dkk-binding fragment thereof, is 
added directly to the assay. In another embodiment, the Dkk interacting or Dkk-1 
interacting protein, or a Dkk-binding fragment thereof, is recombinantly expressed by 
the eukaryotic cell in addition to the first and second peptides. In a preferred 
embodiment the compound comprises a Dkk or Dkk-1 aptamer, a mimetic of a Dkk 
or Dkk-1 peptide aptamer, a Dkk interacting or Dkk-1 interacting protein aptamer, or 
a mimetic of a Dkk-1 interacting protein aptamer. Other preferred embodiments 
provide that the compound comprises an LRP5 peptide aptamer, such as OST262 
(SEQ ID NO:208), Figure 4 (SEQ ID NOs:189-192) (particularly peptide 13 (SEQ ID 
NO:191) and Figure 13 (including SEQ ID NOs:204-214), or a mimetic of such an 
aptamer. Alternatively, the present invention also provides that the compound may 
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comprise LRP5 antibodies or Dkk antibodies. In another embodiment, the yeast cell 
further comprises at least one endogenous nucleotide sequence selected from the 
group consisting of a nucleotide sequence encoding the DNA binding domain of a 
transcriptional activation protein, a nucleotide sequence encoding the transcriptional 
activation domain of a transcriptional activation protein, and a nucleotide sequence 
encoding the reporter element, wherein at least one of the endogenous nucleotide 
sequences is inactivated by mutation or deletion. In another embodiment, the 
peptide binding pair comprises a ligand and a receptor to which the ligand binds. In 
one embodiment, the transcriptional activation protein is Gal4, Gcn4, Hap1 , Adr1 , 
Swi5, Ste12, Mcm1, Yap1, Ace1, Ppr1, Arg81, Lac9, Qa1F, VP16, or a mammalian 
nuclear receptor. In another embodiment, at least one of the heterologous fusion 
proteins is expressed from an autonomously-replicating plasmid. In one 
embodiment, the DNA binding domain comprises a heterologous DNA-binding 
domain of a transcriptional activation protein. In a preferred embodiment, the DNA 
binding protein is selected from the group consisting of a mammalian steroid 
receptor and bacterial LexA protein. In another embodiment, the reporter element is 
selected from the group consisting of lacZ, a polynucleotide encoding luciferase, a 
polynucleotide encoding green fluorescent protein (GFP), and a polynucleotide 
encoding chloramphenicol acetyltransferase. In a particularly preferred embodiment, 
the reporter element is lacZ 

The invention further provides for a rescue screen for detecting the activity of 
a compound for modulating the binding interaction of a first peptide and a second 
peptide of a peptide binding pair, comprising: 

(i) culturing at least one yeast cell, wherein the yeast cell comprises; 

a) a nucleotide sequence encoding a first heterologous fusion 
protein comprising the first peptide or a segment thereof joined 
to a DNA binding domain of a transcriptional activation protein; 

b) a nucleotide sequence encoding a second heterologous 
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fusion protein comprising the second peptide or a segment 
thereof joined to a transcriptional activation domain of a 
transcriptional activation protein; 

wherein binding of the first peptide or segment thereof and the second 
5 peptide or segment thereof reconstitutes a transcriptional activation 

protein; and 

c) a reporter element activated under positive transcriptional 
control of the reconstituted transcriptional activation protein, 
wherein expression of the reporter gene prevents exhibition of a 
1 0 selected phenotype; 

(ii) incubating a compound with the yeast cell under conditions suitable to 
detect the selected phenotype; and 

(iii) detecting the ability of the compound to affect the binding interaction of 
the peptide binding pair by determining whether the compound affects 

15 the expression of the reporter element which prevents exhibition of the 

♦ 

selected phenotype, 
wherein said first peptide is a Dkk peptide and said second peptide is a 
peptide selected from LRP5, HBM, LRP6 and a Dkk-binding fragment of 
LRP5/LRP6/HBM. 

20 In a preferred embodiment, the invention provides that the yeast cell is 

Saccharomyces. In a particularly preferred embodiment, the Saccharomyces cell is 
Saccharomyces cerevisiae. In one embodiment, the compound comprises one or 
more Dkk interacting or Dkk-1 interacting proteins, or a Dkk-binding fragment 
thereof. Compounds used in the present invention may comprise an LRP5 peptide 

25 aptamer, such as OST262 (SEQ ID NO:208), Figure 4 (SEQ ID NOs: 189-1 92) 
(particularly peptide 13 (SEQ ID NO: 191)) and Figure 13 (including SEQ ID 
NOs:204-214), or a mimetic of such an aptamer. Alternatively, the compound may 
comprise LRP5 antibodies or Dkk antibodies. In another embodiment, the yeast cell 
further comprises at least one endogenous nucleotide sequence selected from the 

30 group consisting of a nucleotide sequence encoding the DNA binding domain of a 
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transcriptional activation protein, a nucleotide sequence encoding the transcriptional 
activation domain of a transcriptional activation protein, and a nucleotide sequence 
encoding the reporter gene, wherein at least one of the endogenous nucleotide 
sequences is inactivated by mutation or deletion. In another embodiment, the 
5 transcriptional activation protein is Gal4, Gcn4, Hap1, Adr1, Swi5, Ste12, Mcm1, 
Yap1, Ace1, Ppr1, Arg81, Lac9, Qa1F, VP16, or a mammalian nuclear receptor. In 
one embodiment, at least one of the heterologous fusion proteins is expressed from 
an autonomously-replicating plasmid. In another embodiment, the DNA binding 
domain is a heterologous DNA-binding domain of a transcriptional activation protein. 
10 The invention also provides for a rescue screen for detecting the modulatory 

activity of a compound on the binding interaction of a first peptide and a second 
peptide of a peptide binding pair, comprising: 



(i) culturing at least one yeast cell, wherein the yeast cell comprises; 



15 



a) a nucleotide sequence encoding a first heterologous fusion 

protein comprising the first peptide or a segment thereof joined 
to a DNA binding domain of a transcriptional activation protein; 



20 



b) a nucleotide sequence encoding a second heterologous 
fusion protein comprising the second peptide or a segment 
thereof joined to a transcriptional activation domain of a 
transcriptional activation protein; 



wherein binding of the first peptide or segment thereof and the second 



peptide or segment thereof reconstitutes a transcriptional activation 



protein; and 



25 



c) a reporter element activated under positive transcriptional 
control of the reconstituted transcriptional activation protein, 
wherein expression of the reporter element prevents exhibition 
of a selected phenotype; 



(ii) incubating a compound with the yeast cell under conditions suitable to 



detect the selected phenotype; and 
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(iii) detecting the ability of the compound to affect the binding interaction of 
the peptide binding pair by determining whether the compound affects 
the expression of the reporter element which prevents exhibition of the 
selected phenotype, 

wherein said first peptide is a Dkk interacting or Dkk-1 interacting protein 

peptide and said second peptide is a Dkk or Dkk-1 peptide. 

In a preferred embodiment of the rescue screen, the yeast cell is 
Saccharomyces. In a particularly preferred embodiment, the Saccharomyces cell is 
Saccharomyces cerevisiae. In another embodiment, the yeast cell further comprises 
at least one endogenous nucleotide sequence selected from the group consisting of 
a nucleotide sequence encoding the DNA binding domain of a transcriptional 
activation protein, a nucleotide sequence encoding the transcriptional activation 
domain of a transcriptional activation protein, and a nucleotide sequence encoding 
the reporter gene, wherein at least one of the endogenous nucleotide sequences is 
inactivated by mutation or deletion. In one embodiment, the transcriptional activation 
protein is Gal4, Gcn4, Hap1, Adr1, Swi5, Ste12, Mcm1, Yap1, Ace1, Ppr1, Arg81, 
Lac9, Qa1 F, VP16, or a mammalian nuclear receptor. In another embodiment of the 
rescue screen, at least one of the heterologous fusion proteins is expressed from an 
autonomously-replicating plasmid. In another embodiment, the DNA binding domain 
is a heterologous DNA-binding domain of a transcriptional activation protein. 

The invention also provides for a method for identifying potential compounds 
which modulate Dkk activity comprising: 

a) measuring the effect on binding of one or more Dkk interacting 
protein, or a Dkk-binding fragment thereof, with Dkk or a 
LRP5/LRP6/HBM binding fragment thereof in the presence and 
absence of a compound; and 

b) identifying as a potential Dkk modulatory compound a 
compound which modulates the binding between one or more 
Dkk interacting proteins or Dkk-binding fragment thereof and 
Dkk or LRP5/LRP6/HBM fragment thereof. 
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In a preferred embodiment, the Dkk is Dkk-1 . 

The invention further provides for any of the Dkk peptide aptamers of Figure 3 
(SEQ ID NOs:171-188). The invention also provides for any of the LRP peptide 
aptamers of Figure 4 (SEQ ID NOs:189-192). 

Another aspect of the invention provides for a method of identifying agents 
which modulate the interaction of Dkk with the Wnt signaling pathway comprising: 

(a) injecting mRNA encoding Dkk and an agent into a Xenopus 

b 

blastomere; 

(b) assessing axis duplication or analyzing marker gene expression; and 

(c) identifying agents which elicit changes in axis duplication or marker 
gene expression as agents which modulate the interaction of Dkk with the Wnt 
signaling pathway. Wherein the agent may be chosen from among mRNA encoding 
Dkk interacting proteins, fragments thereof, siRNA, shRNA, antisense nucleotides, 
and antibodies. In a preferred embodiment, Dkk is Dkk-1 . In a further embodiment, 
mRNA of HBM, LRP5/6, any Wnt (including Wnt1-Wnt19, particularly Wnt1 , Wnt3, 
Wnt3a, and Wnt10b), Wnt antagonist, or combination of these is co-injected into the 
Xenopus blastomere. In another embodiment, the marker gene analyzed could 
include Siamois, Xnr3, slug, Xbra, HNK-1, endodermin, Xlhbox8, BMP2, BMP4, 
XLRP6, EF-1,or ODC. 

The present invention provides for a method for identifying agents which 
modulate the interaction of Dkk with the Wnt signaling pathway comprising: 

(a) transfecting cells with constructs encoding Dkk and potential Dkk 
interacting proteins, mRNA fragments thereof, siRNA, shRNA, or 
antisense, antibodies to LRP5/HBM/LRP6/Dkk/Dkk-interacting protein; 

(b) assessing changes in expression of a reporter gene linked to a Wnt- 
responsive promoter; and, 

(c) identifying as a Dkk interacting protein any protein which alters reporter 
gene expression compared with cells transfected with a Dkk construct 
alone. In a further preferred embodiment, the cells may be HOB-03- 
CE6, HEK293, or U20S cells. 
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In alternative embodiments, the Wnt-responsive promoter is TCF or LEF. In 
other preferred embodiments, the cells are co-transfected with CMV beta- 
galactosidase or tk-Renilla. 

The present invention further provides for a LRP5/HBM monoclonal or 
polyclonal antibody to one or more peptides of amino acid sequences 
MYWTDWVETPRIE (SEQ ID NO: 123), MYWTDWGETPRIE (SEQ ID NO:124), 
KRTGG KRKE I LSA (SEQ ID NO:125), ERVEKTTGDKRTRIQGR (SEQ ID NO:126), 
or KQQCDSFPDCIDGSDE (SEQ ID NO:127). 

Additionally, the present invention provides a method for identifying 
compounds which modulate Dkk and LRP5/LRP6/HBM interactions comprising: 

(a) immobilizing LRP5/LRP6/HBM to a solid surface; and 

(b) treating the solid surface with a secreted Dkk protein or a secreted 
epitope-tagged Dkk and a test compound; and 

(c) determining whether the compound regulates binding between Dkk and 
LRP5/LRP6/HMB using antibodies to Dkk or the epitope tag or by 
directly measuring activity of an epitope tag. 

In one embodiment, the Dkk is Dkk-1. In a preferred embodiment, the epitope 
tag is alkaline phosphatase, histidine, myc, or a V5 tag. 

Another embodiment of the present invention provides for a method for 
identifying compounds which modulate Dkk and LRP5/LRP6/HBM interactions 
comprising: 

(a) creating an LRP5, LRP6, or HBM fluorescent fusion protein using a first 
fluorescent tag; 

(b) creating a Dkk fusion protein comprising a second fluorescent tag; 

(c) adding a test compound; and, 

(d) assessing changes in the ratio of fluorescent tag emissions using 
Fluorescence Resonance Energy Transfer (FRET) or Bioluminescent 
Resonance Energy Transfer (BRET) to determine whether the 
compound modulates Dkk and LRP5/LRP6/HBM interactions. 

In a preferred embodiment, the Dkk is Dkk-1. 
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The present invention also provides for a method of diagnosing low or high 
bone mass and/or low or high lipid levels in a subject comprising examining 
expression of Dkk, LRP5, LRP6, HBM or HBM-like variant in the subject and 
determining whether Dkk, LRP5, LRP6, or HBM or a HBM-like variant is over- or 
under-expressed to determine whether subject has (a) high or low bone mass and/or 
(b) high or low lipid levels. 

The invention further provides for a transgenic animal wherein Dkk is knocked 
out in a tissue-specific fashion. In a preferred embodiment, the Dkk is Dkk-1. In one 
preferred embodiment, the tissue specificity is bone tissue. In another preferred 
embodiment, the tissue specificity is liver or other tissues or cells involved in 
regulating lipid metabolism or cancer tissue. 

The present invention further provides a method of screening for compounds 
which modulate the interaction of Dkk with LRP5, LRP6, or HBM comprising: 

(a) exposing LRP5, LRP6, or HBM, or a Dkk-binding fragment of LRP5, 
LRP6, or HBM to a compound; and 

(b) determining whether said compound bound to LRP5, LRP6, or HBM or 
the Dkk-binding fragment of LRP5, LRP6, or HBM and further 
determining whether said compound modulates the interaction of Dkk 
and LRP5, LRP6, or HBM. 

In one embodiment, the Dkk is Dkk-1 . In a preferred embodiment, the 
compound comprises an LRP5 peptide aptamer. Other preferred compositions 
include the peptide aptamer, OST262 (SEQ ID NO:208), Figure 4 (SEQ ID NOs:189- 

4 

192) (particularly peptide 13 (SEQ ID NO: 191) and Figure 13 (including SEQ ID 
NOs:204-214), or a mimetic of such an aptamer, and an LRP5 antibody. 

The present invention also provides a method for identifying compounds 
which modulate Dkk and LRP5/LRP6/HBM interactions comprising: 

(a) immobilizing LRP5/LRP6/HBM to a solid surface; and 

(b) treating the solid surface with a secreted Dkk protein or a secreted 
epitope-tagged Dkk and a test compound; and 
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(c) determining whether the compound regulates binding between Dkk and 
LRP5/LRP6/HBM using antibodies to Dkk or the epitope tag or by 
directly measuring activity of an epitope tag. In a preferred 
embodiment, the epitope tag is alkaline phosphatase, histidine, myc or 
a V5 tag. 

In a preferred embodiment, the Dkk is Dkk-1. 

The invention also provides for a method for identifying compounds which 
modulate the interaction of Dkk with the Wnt signaling pathway comprising: 

(a) transfecting cells with constructs containing Dkk and Wnt proteins; 

(b) assessing changes in expression of a reporter element linked to a Wnt- 
responsive promoter; and 

(c) identifying as a Dkk/Wnt interaction modulating compound any 
compound which alters reporter gene expression compared with cells 
transfected with a Dkk construct alone. 

In one embodiment, the Dkk is Dkk-1. In another embodiment, the Wnt is any 
of Wnt1-Wnt19. In a preferred embodiment, the Wnt is Wnt1 , Wnt3, Wnt3a, or 
Wnt10b. In a particularly preferred embodiment, the Wnt construct contains Wnt3a. 
In another particularly preferred embodiment, the Wnt construct contains Wnt1 . In 
another preferred embodiment, the Wnt construct encodes for a Wnt that signals 
through the canonical Wnt pathway. In a particularly preferred embodiment, both 
Wnt3a and Wnt1 constructs are co-transfected into the cells. In another 
embodiment, the cells may be U2-OS, HOB-03-CE6, or HEK293 cells. In another 
embodiment, the reporter element used is TCF-luciferase, tk-Renilla, or a 
combination thereof. 

The invention also provides for a method of testing compounds that modulate 
Dkk-mediated activity in a mammal comprising: 

(a) providing a group of transgenic animals having (1 ) a regulatable one or 
more Dkk genes, (2) a knock-out of Dkk genes, or (3) a knock-in of one 
or more Dkk genes; 
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(b) providing a second group of control animals respectively for the group 
of transgenic animals in step (a); and 

(c) exposing the transgenic animal group and control animal group to a 
potential Dkk-modulating compound which modulates bone mass or 
lipid levels; and 

(d) comparing the transgenic animals and the control group of animals and 
determining the effect of the compound on bone mass or lipid levels in 
the transgenic animals as compared to the control animals. 

In a preferred embodiment, the Dkk is Dkk-1 . 

The invention further provides variants of LRP5 which demonstrate HBM 
biological activity, i.e., that are "HBM-like." In preferred embodiments, variants 
G171F, M282V, G171K, G171Q, A65V, G171V, G171I, and A214V of LRP5 are 
provided. The invention further provides for the use any of these variants in the 
forgoing methods. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 shows a schematic of the components of the Wnt signal transduction 
pathway. Schematic obtained from: 
http://www.stanford.edu/-rnusse/pathways/cell2.html 

Figure 2 (A-C) show bait sequences (SEQ ID NOs:168-170) utilized in yeast 
two hybrid (Y2H) screens for protein-protein interactions. 

Figure 3 shows a table of peptide aptamer insert sequences (SEQ ID NOs: 
171-192) identified in Y2H screen with a Dkk-1 bait sequence. 

Figure 4 shows a table of peptide aptamer insert sequences identified in a 
Y2H screen using a LRP5 ligand binding domain bait sequence. 

Figure 5 shows a table of proteins identified in a Y2H screen using a Dkk-1 
bait sequence. These proteins are identified by both their nucleic acid and amino 
acid accession numbers. 

Figure 6 shows the results of a minimum interaction domain mapping screen 
of Dkk-1 with LRP5. At the top, a map of Dkk-1 showing the location of the signal 
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sequence, and cysteine rich domains 1 and 2. Below, the extent of domains 
examined using LRP5 LBD baits, LBD1 and LBD4, of Figure 2. To the right, scoring 
of the binding results observed in the experiment. 

Figure 7 shows a diagram of the Xenopus Embryo Assay for Wnt activity. 

Figure 8 shows the effects of Zmax/LRP5 and HBM on Wnt signaling in the 
Xenopus embryo assay. 

Figure 9 shows the effects of Zmax/LRP5 and HBM on induction of secondary 
axis formation in the Xenopus embryo assay. 

Figure 10 shows the effects of human Dkk-1 on the repression of the 
canonical Wnt pathway. 

Figure 11 shows the effects of human Dkk-1 on Zmax/LRP5 and HBM- 
mediated Wnt signaling. 

Figure 12 shows pcDNA3.1 construct names with nucleotide sequences 
(including SEQ ID NOs:1 93-203) for LRP5-binding peptide aptamers, Dkk-1 peptides 
and control constructs. 

Figure 13 shows the amino acid sequences (including SEQ ID NOs:204-214) 
for the corresponding LRP5-binding peptides, Dkk-1 peptide aptamers and control 
constructs in Figure 12. 

Figure 14 shows the effects of Dkk-1 and Dkk-2 on Wnt1 signaling with 
coreceptors LRP5, HBM, and LRP6 in HOB03CE6 cells. 

Figure 15 shows the effects of Dkk-1 and Dkk-2 on Wnt3a signaling with 
coreceptors LRP5, HBM, and LRP6 in HOB03CE6 cells. 

Figure 16 demonstrates that the LRP5-LBD peptide aptamer 262 activates 
Wnt signaling in the presence of Wnt3a in U20S cells. 

Figure 17 shows the differential binding of an antibody generated to a 
sequence (a.a. 165-177) containing the HBM mutation in LRP5 in LRP5 and HBM 
virus-infected cells. 

Figure 18 shows data generated from a Y2H interaction trap where a mutant 
Dkk-1 (C220A) is unable to bind to LRP5 and demonstrating the window of capability 
of detecting small molecule effects on LRP and Dkk interactions. 
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Figure 19 shows that Dkk-1 represses Wnt3a-mediated Wnt signaling in 
U20S bone cells using the cell-based reporter gene assay for high throughput 
screening. 

Figure 20 demonstrates that Wnt1-HBM generated signaling is not efficiently 
5 inhibited by Dkk-1 in U20S bone cells while LRP5 and LRP6-mediated signaling are 
using the cell-based reporter gene assay for high throughput screening. 

Figure 21 shows that the TCF signal in the cell-based reporter gene assay for 
high throughput screening can be modulated by Dkk-1 and Dkk-1 -AP without Wnt 
DNAtransfection. 

10 Figure 22 shows the morphological results in the Xenopus assay using 

aptamers 261 and 262 from the LRP5-LBD to activate Wnt signaling. 

Figure 23 demonstrates that LRP5-LBD aptamers 261 and 262 induce Wnt 
signaling over other LRP5 aptamers. 

Figure 24 shows that the mutation G171F in LRP5 produces a greater 
15 activation of the Wnt pathway than LRP5 which is consistent with HBM activity. 

Figure 25 shows that the mutation M282V in LRP5 produces an activation of 
the Wnt pathway which is consistent with HBM activity in U20S cells. 

Figure 26 shows the amino acid sequence of the various peptides of dkk-1 
selected to generate polyclonal antibodies, their relationship to the Dkk-1 amino acid 
20 sequence and identities of polyclonal antibodies generated. 

Figure 27 shows a Western blot demonstrating that polyclonal antibody #5521 
to amino acids 165-186 of Dkk-1 was able to detect Dkk1-V5 and Dkk1-AP from 
conditioned medium. 

Figure 28 shows a Western blot demonstrating that polyclonal antibody 
25 #74397 to amino acids 147-161 was able to detect Dkk1-V5 in both conditioned 
medium and immunoprecipitated conditioned medium. 
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DETAILED DESCRIPTION OF THF INVENTION 
1. Definitions 

In general, terms in the present application are used consistent with the 
manner in which those terms are understood in the art. To aid in the understanding 
of the specification and claims, the following definitions are provided. 

"Gene" refers to a DNA sequence that encodes through its template or 
messenger RNA a sequence of amino acids characteristic of a specific peptide. The 
term "gene" includes intervening, non-coding regions, as well as regulatory regions, 
and can include 5' and 3' ends. 

By "nucleic acid" is meant to include single stranded and double stranded 
nucleic acids including, but not limited to DNAs, RNAs (e.g., mRNA, tRNAs, 
siRNAs), cDNAs, recombinant DNA (rDNA), rRNAs, antisense nucleic acids, 
oligonucleotides, and oligomers, and polynucleotides. The term may also include 
hybrids such as triple stranded regions of RNA and/or DNA or double stranded 
RNA:DNA hybrids. The term also is contemplated to include modified nucleic acids 
such as, but not limited to biotinylated nucleic acids, tritylated nucleic acids, 
fluorophor labeled nucleic acids, inosine, and the like. 

"Gene sequence" refers to a. nucleic acid molecule, including DNA which 
contains a non-transcribed or non-translated sequence, which comprises a gene. 
The term is also intended to include any combination of gene(s), gene fragment(s), 
non-transcribed sequence(s) or non-translated sequence(s) which are present on the 
same DNA molecule. 

The nucleic acid sequences of the present invention may be derived from a 
variety of sources including DNA, cDNA, synthetic DNA, synthetic RNA or 
combinations thereof. Such sequences may comprise genomic DNA which may or 
may not include naturally occurring introns. Moreover, such genomic DNA may be 
obtained in association with promoter regions and/or poly (A) sequences. The 
sequences, genomic DNA or cDNA may be obtained in any of several ways. 
Genomic DNA can be extracted and purified from suitable cells by means well 
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known in the art. Alternatively, mRNA can be isolated from a cell and used to 
produce cDNA by reverse transcription or other means. 

"cDNA" refers to complementary or copy DNA produced from an RNA 
template by the action of RNA-dependent DNA polymerase (reverse transcriptase). 
5 Thus, a "cDNA clone" means a duplex DNA sequence for which one strand is 

complementary to an RNA molecule of interest, carried in a cloning vector or PCR 
amplified. cDNA can also be single stranded after first strand synthesis by reverse 
transcriptase. In this form, it is a useful PCR template and does not need to be 
carried in a cloning vector. This term includes genes from which the intervening 
10 sequences have been removed. Thus, the term "gene", as sometimes used 

generically, can also include nucleic acid molecules comprising cDNA and cDNA 
clones. 

"Recombinant DNA" means a molecule that has been engineered by splicing 
in vitro a cDNA or genomic DNA sequence or altering a sequence by methods such 
15 as PCR mutagenesis. 

"Cloning" refers to the use of in vitro recombination techniques to insert a 
particular gene or other DNA sequence into a vector molecule. In order to 
successfully clone a desired gene, it is necessary to use methods for generating 
DNA fragments, for joining the fragments to vector molecules, for introducing the 
20 composite DNA molecule into a host cell in which it can replicate, and for selecting 
the clone having the target gene from amongst the recipient host cells. 

"cDNA library" refers to a collection of recombinant DNA molecules containing 
cDNA inserts which together comprise the entire or a partial repertoire of genes 
expressed in a particular tissue or cell source. Such a cDNA library can be prepared 
25 by methods known to one skilled in the art and described by, for example, Cowell 
and Austin, "cDNA Library Protocols," Methods in Molecular Biology (1997). 

"Cloning vehicle" refers to a plasmid or phage DNA or other DNA sequence 
which is able to replicate in a host cell. This term can also include artificial 
chromosomes such as BACs and YACs. The cloning vehicle is characterized by one 
30 or more endonuclease recognition sites at which such DNA sequences may be cut in 
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a determinable fashion without loss of an essential biological function of the DNA t 
which may contain a marker suitable for use in the identification of transformed cells. 

"Expression" refers to the process comprising transcription of a gene 
sequence and subsequent processing steps, such as translation of a resultant 
mRNA to produce the final end product of a gene. The end product may be a 
protein (such as an enzyme or receptor) or a nucleic acid (such as a tRNA, 
antisense RNA, or other regulatory factor). The term "expression control sequence" 
refers to a sequence of nucleotides that control or regulate expression of structural 
genes when operably linked to those genes. These include, for example, the lac 
systems, the trp system, major operator and promoter regions of the phage lambda, 
the control region of fd coat protein and other sequences known to control the 
expression of genes in prokaryotic or eukaryotic cells. Expression control 
sequences will vary depending on whether the vector is designed to express the 
operably linked gene in a prokaryotic or eukaryotic host, and may contain 
transcriptional elements such as enhancer elements, termination sequences, tissue- 
specificity elements and/or translational initiation and termination sites. 

"Expression vehicle" refers to a vehicle or vector similar to a cloning vehicle 
but which is capable of expressing a gene which has been cloned into it, after 
transformation into a host. The cloned gene is usually placed under the control of 
(i.e., operably linked to) an expression control sequence. 

"Operator" refers to a DNA sequence capable of interacting with the specific 
repressor, thereby controlling the transcription of adjacent gene(s). 

"Promoter" refers to a DNA sequence that can be recognized by an RNA 
polymerase. The presence of such a sequence permits the RNA polymerase to bind 
and initiate transcription of operably linked gene sequences. 

"Promoter region" is intended to include the promoter as well as other gene 
sequences which may be necessary for the initiation of transcription. The presence 
of a promoter region is sufficient to cause the expression of an operably linked gene 
sequence. The term "promoter" is sometimes used in the art to generically indicate 
a promoter region. Many different promoters are known in the art which direct 
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expression of a gene in a certain cell types. Tissue-specific promoters can comprise 
nucleic acid sequences which cause a greater (or decreased) level of expression in 
cells of a certain tissue type. 

"Operably linked" means that the promoter controls the initiation of expression 
5 of the gene. A promoter is operably linked to a sequence of proximal DNA if upon 
introduction into a host cell the promoter determines the transcription of the proximal 
DNA sequence(s) into one or more species of RNA. A promoter is operably linked to 
a DNA sequence if the promoter is capable of initiating transcription of that DNA 
sequence. 

10 "Prokaryote" refers to all organisms without a true nucleus, including bacteria. 

"Eukaryote" refers to organisms and cells that have a true nucleus, including 
mammalian cells. 

"Host" includes prokaryotes and eukaryotes, such as yeast and filamentous 
fungi, as well as plant and animal cells. The term includes an organism or cell that is 

1 5 the recipient of a replicable expression vehicle. 

The term "animal" is used herein to include all vertebrate animals, except 
humans. It also includes an individual animal in all stages of development, including 
embryonic and fetal stages. Preferred animals include higher eukaryotes such as 
avians, rodents (e.g., mice, rabbits, rats, chinchillas, guinea pigs, hamsters and the 

20 like), and mammals. Preferred mammals include bovine, equine, feline, canine, 
ovine, caprine, porcine, buffalo, humans, and primates. 

A "transgenic animal" is an animal containing one or more cells bearing 
genetic information received, directly or indirectly, by deliberate genetic manipulation 
or by inheritance from a manipulated progenitor at a subcellular level, such as by 

25 microinjection or infection with a recombinant viral vector (e.g., adenovirus, 

retrovirus, herpes virus, adeno-associated virus, lentivirus). This introduced DNA 
molecule may be integrated within a chromosome, or it may be extra-chromosomally 
replicating DNA. 

"Embryonic stem cells" or "ES cells" as used herein are cells or cell lines 
30 usually derived from embryos which are pluripotent meaning that they are un- 
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differentiated cells. These cells are also capable of incorporating exogenous DNA 
by homologous recombination and subsequently developing into any tissue in the 
body when incorporated into a host embryo. It is possible to isolate pluripotent cells 
from sources other than embryonic tissue by methods which are well understood in 
the art. 

Embryonic stem cells in mice have enabled researchers to select for 
transgenic cells and perform gene targeting. This allows more genetic engineering 
than is possible with other transgenic techniques. For example, mouse ES cells are 
relatively easy to grow as colonies in vitro. The cells can be transfected by standard 
procedures and transgenic cells clonally selected by antibiotic resistance. See, for 
example, Doetschman et a/.., 1994, Gene transfer in embryonic stem cells. In 
Pinkert (Ed.) Transgenic Animal Technology: A Laboratory Handbook . Academic 
Press Inc., New York, pp.1 15-1 46. Furthermore, the efficiency of this process is 
such that sufficient transgenic colonies (hundreds to thousands) can be produced to 
allow a second selection for homologous recombinants. Mouse ES cells can then be 
combined with a normal host embryo and, because they retain their potency, can 
develop into all the tissues in the resulting chimeric animal, including the germ cells. 
The transgenic modification can then be transmitted to subsequent generations. 

Methods for deriving embryonic stem (ES) cell lines in vitro from early 
preimplantation mouse embryos are well known. See for example, Evans et a/., 
1981 Nature 29: 154-6 and Martin, 1981, Proc. Nat Acad. Sci. USA, 78: 7634-8. ES 
cells can be passaged in an undifferentiated state, provided that a feeder layer of 
fibroblast cells or a differentiation inhibiting source is present. 

The term "somatic cell" indicates any animal or human cell which is not a 
sperm or egg cell or is capable of becoming a sperm or egg cell. The term "germ 
cell" or "germ-line cell" refers to any cell which is either a sperm or egg cell or is 
capable of developing into a sperm or egg cell and can therefore pass its genetic 
information to offspring. The term "germ cell-line transgenic animal" refers to a 
transgenic animal in which the genetic information was incorporated in a germ line 
cell, thereby conferring the ability to transfer the information to offspring. If such 
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offspring in fact possess some or all of that information, then they, too, are 
transgenic animals. 

The genetic alteration of genetic information may be foreign to the species of 
animal to which the recipient belongs, or foreign only to the particular individual 
recipient. In the last case, the altered or introduced gene may be expressed 
differently than the native gene. 

"Fragment" of a gene refers to any portion of a gene sequence. A 
"biologically active fragment" refers to any portion of the gene that retains at least 
one biological activity of that gene. For example, the fragment can perhaps 
hybridize to its cognate sequence or is capable of being translated into a polypeptide 
fragment encoded by the gene from which it is derived. 

"Variant" refers to a gene that is substantially similar in structure and 
biological activity or immunological characteristics to either the entire gene or to a 
fragment of the gene. Provided that the two genes possess a similar activity, they 
are considered variant as that term is used herein even if the sequence of encoded 
amino acid residues is not identical. Preferentially, as used herein (unless otherwise 
defined) the variant is one of LRP5, HBM or LRP6. The variant preferably is one that 
yields an HBM-like phenotype (i.e., enhances bones mass and/or modulates lipid 
levels). These variants include missense mutations, single nucleotide 
polymorphisms (SNPs), mutations which result in changes in the amino acid 
sequence of the protein encoded by the gene or nucleic acid, and combinations 

* 

thereof, as well as com in the exon domains of the HBM gene and mutations in 
LRP5 or LRP6 which result in an HBM like phenotype. 

"Amplification of nucleic acids" refers to methods such as polymerase chain 
reaction (PGR), ligation amplification (or ligase chain reaction, LCR) and 
amplification methods based on the use of Q-beta replicase. These methods are 
well known in the art and described, for example, in U.S. Patent Nos. 4,683,195 and 
4,683,202. Reagents and hardware for conducting PCR are commercially available. 
Primers useful for amplifying sequences from the HBM region are preferably 
complementary to, and hybridize specifically to sequences in the HBM region or in 
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regions that flank a target region therein. HBM sequences generated by 
amplification may be sequenced directly. Alternatively, the amplified sequence(s) 
may be cloned prior to sequence analysis. 

"Antibodies" may refer to polyclonal and/or monoclonal antibodies and 
fragments thereof, and immunologic binding equivalents thereof, that can bind to the 
HBM proteins and fragments thereof or to nucleic acid sequences from the HBM 
region, particularly from the HBM locus or a portion thereof. Preferred antibodies 
also include those capable of binding to LRP5, LRP6 and HBM variants. The term 
antibody is used both to refer to a homogeneous molecular entity, or a mixture such 
as a serum product made up of a plurality of different molecular entities. Proteins 
may be prepared synthetically in a protein synthesizer and coupled to a carrier 
molecule and injected over several months into rabbits. Rabbit sera is tested for 
immunoreactivity to the HBM protein or fragment. Monoclonal antibodies may be 
made by injecting mice with the proteins, or fragments thereof. Monoclonal 
antibodies will be screened by ELISA and tested for specific immunoreactivity with 
HBM protein or fragments thereof. Harlow et a/., Antibodies: A Laboratory Manual, 
Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1988) and Using 
Antibodies: A Laboratory Manual, Harlow, Ed and Lane, David (Cold Spring Harbor 
Press, 1999). These antibodies will be useful in assays as well as pharmaceuticals. 
By "antibody" is meant to include but not limited to polyclonal, monoclonal, chimeric, 
human, humanized, bispecific, multispecific, primatized™ antibodies. 

"HBM protein" refers to a protein that is identical to a Zmaxl (LRP5) protein 
except that it contains an alteration of glycine 171 to a valine. An HBM protein is 
defined for any organism that encodes a Zmaxl (LRP5) true homolog. For example; 
a mouse HBM protein refers to the mouse Zmaxl (LRP5) protein having the glycine 
170 to valine substitution. 

By "HBM-like" is meant a variant of LRP5, LRP6 or HBM which when 
expressed in a cell is capable of modulating bone mass, lipid levels, Dkk activity, 
and/or Wnt activity. 
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In one embodiment of the present invention, "HBM gene" refers to the 
genomic DNA sequence found in individuals showing the HBM characteristic or 
phenotype, where the sequence encodes the protein indicated by SEQ ID NO: 4. 
The HBM gene and the Zmaxl {LRP5) gene are allelic. The protein encoded by the 
HBM gene has the property of causing elevated bone mass, while the protein 
encoded by the Zmaxl (LRP5) gene does not. The HBM gene and the Zmaxl 
(LRP5) gene differ in that the HBM gene has a thymine at position 582, while the 
Zmaxl gene has a guanine at position 582. The HBM gene comprises the nucleic 
acid sequence shown as SEQ ID NO: 2. The HBM gene may also be referred to as 
an "HBM polymorphism." Other HBM genes may further have silent mutations, such 
as those discussed in Section 3 below. 

In alternative embodiments of the present invention, "HBM gene" may also 
refer to any allelic variant of Zmaxl (LRP5) or LRP6 which results in the HBM 
phenotype. Such variants may include alteration from the wild-type protein coding 
sequence as described herein and/or alteration in expression control sequences of 
Zmaxl (LRP5) or contains an amino acid mutation in LRP5 or LRP6, such that the 
resulting protein produces a phenotype which enhances bone mass and/or 
modulates lipid levels. A preferred example of such a variant is an alteration of the 
endogenous Zmaxl (LRP5) promoter region resulting in increased expression of the 
Zmaxl (LRP5) protein. 

"Normal," "wild-type," "unaffected", "Zmaxl", "Zmax", "LR3" and "LRP5" all 
refer to the genomic DNA sequence that encodes the protein indicated by SEQ ID 
NO: 3. LRP5 has also been referred to LRP7 in mouse. Zmaxl , LRP5 and Zmax 
may be used interchangeably throughout the specification and are meant to be the 
same gene, perhaps only relating to the gene in a different organism. The Zmaxl 
gene has a guanine at position 582 in the human sequence. The Zmaxl gene of 
human comprises the nucleic acid sequence shown as SEQ ID NO: 1. "Normal," 
"wild-type," "unaffected", "Zmaxl" and "LRP5" also refer to allelic variants of the 
genomic sequence that encodes proteins that do not contribute to elevated bone 
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mass. The Zmaxl (LRP5) gene is common in the human population, while the HBM 
gene is rare. 

"Bone development" generally refers to any process involved in the change of 
bone over time, including, for example, normal development, changes that occur 
during disease states, and changes that occur during aging. This may refer to 
structural changes and dynamic rate changes such as growth rates, resorption rates, 
bone repair rates, and etc. "Bone development disorder" particularly refers to any 
disorders in bone development including, for example, changes that occur during 
disease states and changes that occur during aging. Bone development may be 
progressive or cyclical in nature. Aspects of bone that may change during 
development include, for example, mineralization, formation of specific anatomical 
features, and relative or absolute numbers of various cell types. 

"Bone modulation" or "modulation of bone formation" refers to the ability to 
affect any of the physiological processes involved in bone remodeling, as will be 
appreciated by one skilled in the art, including, for example, bone resorption and 
appositional bone growth, by, inter alia, osteoclastic and osteoblastic activity, and 
may comprise some or all of bone formation and development as used herein. 

Bone is a dynamic tissue that is continually adapting and renewing itself 
through the renewal of old or unnecessary bone by osteoclasts and the rebuilding of 
new bone by osteoblasts. The nature of the coupling between these processes is 
responsible for both the modeling of bone during growth as well as the maintenance 
of adult skeletal integrity through remodeling and repair to meet the everyday needs 
of mechanical usage. There are a number of diseases that result from an 
uncoupling of the balance between bone resorption and formation. With aging there 
is a gradual "physiologic" imbalance in bone turnover, which is particularly 
exacerbated in women due to menopausal loss of estrogen support, that leads to a 
progressive loss of bone. As bone mineral density falls below population norms 
there is a consequent increase in bone fragility and susceptibility to spontaneous 
fractures. For every 1 0 percent of bone that is lost, the risk of fracture doubles. 
Individuals with bone mineral density (BMD) in the spine or proximal femur 2.5 or 
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more standard deviations below normal peak bone mass are classified as 
osteoporotic. However, osteopenic individuals with BMD between 1 and 2.5 
standard deviations below the norm are clearly at risk. 

Bone is measured by several different forms of X-ray absorptiometry. All of 
the instruments measure the inorganic or bone mineral content of the bone. 
Standard DXA measurements give a value that is an areal density, not a true density 
measurement by the classical definition of density (mass/unit volume). 
Nevertheless, this is the type of measurement used clinically to diagnose 
osteoporosis. However, while BMD is a major contributing factor to bone strength, 
as much as 40% of bone strength stems from other factors including: 1) bone size 
(i.e., larger diameters increase organ-level stiffness, even in the face of lower 
density); 2) the connectivity of trabecular structures; 3) the level of remodeling 
(remodeling loci are local concentrators of strain); and 4) the intrinsic strength of the 
bony material itself, which in turn is a function of loading history (i.e., through 
accumulated fatigue damage) and the extent of collagen cross-linking and level of 
mineralization. There is good evidence that all of these strength/fragility factors play 
some role in osteoporotic fractures, as do a host of extraskeletal influences as well 
(such as fall patterns, soft tissue padding, and central nervous system reflex 
responsiveness). 

Additional analytical instruments can be used to address these features of 
bone. For example, the pQCT allows measurement of separate trabecular and 
cortical compartments for size and density and the /^CT provides quantitative 
information on architectural features such as trabecular connectivity. The /xCJ also 
gives a true bone density measurement. With these tools, the important non-BMD 
parameters can be measured for diagnosing the extent of disease and the efficacy of 
treatments. Current treatments for osteoporosis are based on the ability of drugs to 
prevent or retard bone resorption. Although newer anti-resorptive agents are proving 
to be useful in the therapy of osteoporosis, they are viewed as short-term solutions 
to the more definitive challenge to develop treatments that will increase bone mass 
and/or the bone quality parameters mentioned above. 
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Thus, bone modulation may be assessed by measuring parameters such as 
bone mineral density (BMD) and bone mineral content (BMC) by pDXA X-ray 
methods, bone size, thickness or volume as measured by X-ray, bone formation 
rates as measured for example by calcien labeling, total, trabecular, and mid-shaft 
density as measured by pQCT and/or ^Cl methods, connectivity and other 
histological parameters as measured by fxCT methods, mechanical bending and 
compressive strengths as preferably measured in femur and vertebrae respectively. 
Due to the nature of these measurements, each may be more or less appropriate for 
a given situation as the skilled practitioner will appreciate. Furthermore, parameters 
and methodologies such as a clinical history of freedom from fracture, bone shape, 
bone morphology, connectivity, normal histology, fracture repair rates, and other 
bone quality parameters are known and used in the art. Most preferably, bone 
quality may be assessed by the compressive strength of vertebra when such a 
measurement is appropriate. Bone modulation may also be assessed by rates of 
change in the various parameters. Most preferably, bone modulation is assessed at 
more than one age. 

"Normal bone density" refers to a bone density within two standard deviations 
of a Z score of 0 in the context of the HBM linkage study. In a general context, the 
range of normal bone density parameters is determined by routine statistical 
methods. A normal parameter is within about 1 or 2 standard deviations of the age 
and sex normalized parameter, preferably about 2 standard deviations. A statistical 
measure of meaningfulness is the P value which can represent the likelihood that the 
associated measurement is significantly different from the mean. Significant P 
values are P < 0.05, 0.01 , 0.005, and 0.001 , preferably at least P < 0.01 . 

"HBM" refers to "high bone mass" although this term may also be expressed 
in terms of bone density, mineral content, and size. 

The "HBM phenotype" and "HBM-like phenotype" may be characterized by an 
increase of about 2 or more standard deviations, preferably 2, 2.5, 3, or more 
standard deviations in 1 , 2, 3, 4, 5, or more quantitative parameters of bone 
modulation, preferably bone density and mineral content and bone strength 



46 



WO 02/092015 



PCT/US02/15982 



parameters, above the age and sex norm for that parameter. The HBM phenotype 
and HBM-like phenotype are characterized by statistically significant increases in at 
least one parameter, preferably at least 2 parameters, and more preferably at least 3 
or more parameters. The HBM phenotype and the HBM-like phenotype may also be 

• 5 characterized by an increase in one or more bone quality parameters and most 

preferably increasing parameters are not accompanied by a decrease in any bone 
quality parameters. Most preferably, an increase in bone modulation parameters 
and/or bone quality measurements is observed at more than one age. The HBM 
phenotype and HBM-like phenotype also includes changes of lipid levels, Wnt 

1 0 activity and/or Dkk activity. 

The terms "isolated" and "purified" refer to a substance altered by hand of 
man from the natural environment. An isolated peptide may be for example in a 
substantially pure form or otherwise displaced from its native environment such as 
by expression in an isolated cell line or transgenic animal. An isolated sequence 

15 may for example be a molecule in substantially pure form or displaced from its native 
environment such that at least one end of said isolated sequence is not contiguous 
with the sequence it would be contiguous with in nature. 

"Biologically active" refers to those forms of proteins and polypeptides, 
including conservatively substituted variants, alleles of genes encoding a protein or 

20 polypeptide fragments of proteins which retain a biological and/or immunological 
activity of the wild-type protein or polypeptide. Preferably the activity is one which 
induces a change in Dkk activity, such as inhibiting the interaction of Dkk with a 
ligand binding partner (e.g., LRP5 or LRP6 or Dkk-1 with a Dkk-1 interacting protein 
such as those shown in Figure 5). By biologically active is also meant to include any 

25 form which modulates Wnt signaling. 

By "modulate" and "regulate" is meant methods, conditions, or agents which 
increase or decrease the wild-type activity of an enzyme, inhibitor, signal transducer, 
receptor, transcription activator, co-factor, and the like. This change in activity can 
be an increase or decrease of mRNA translation, mRNA or DNA transcription, and/or 
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mRNA or protein degradation, which may in turn correspond to an increase or 
decrease in biological activity. 

By " modulated activity" is meant any activity, condition, disease or phenotype 
which is modulated by a biologically active form of a protein. Modulation may be 
5 effected by affecting the concentration or subcellular localization of biologically active 
protein, i.e., by regulating expression or degradation, or by direct agonistic or 
antagonistic effect as, for example, through inhibition, activation, binding, or release 
of substrate, modification either chemically or structurally, or by direct or indirect 
interaction which may involve additional factors. 

10 By "effective amount" or "dose effective amount" or "therapeutically effective 

amount" is meant an amount of an agent which modulates a biological activity of the 
polypeptide of the invention. 

By "immunologically active" is meant any immunoglobulin protein or fragment 
thereof which recognizes and binds to an antigen. 

1 5 By "Dkk" is meant to refer to the nucleic acids and proteins of members of the 

Dkk (Dickkopf) family. This includes, but is not limited to, Dkk-1, Dkk-2, Dkk-3, Dkk- 
4, Soggy, and related Dkk proteins. Dkk-1 is a preferred embodiment of the present 
invention. However, the Dkk proteins have substantial homology and one skilled in 
the art will appreciate that all of the embodiments of the present invention utilizing 

20 Dkk-1 may also be utilized with the other Dkk proteins. 

By "Dkk-1" is meant to refer to the Dkk-1 protein and nucleic acids which 
encode the Dkk-1 protein. Dkk-1 refers to Dickkopf-1 , and in Xenopus it is related to 
at least Dkk-2, Dkk-3, and Dkk-4 (see Krupnik et a/., Gene 238:301-313 (1999)). 
Dkk-1 was first identified in Xenopus (Glinka et a/., Nature 391 :357-62 (1998)). It 

25 was recognized as a factor capable of inducing ectopic head formation in the 

presence of inhibition of the BMP pathway. It was then also found to inhibit the axis- 
inducing activity of several Xenopus Wnt molecules by acting as an extracellular 
antagonist of Wnt signaling. Mammalian homologs have been found including Dkk- 
1, Dkk-2, Dkk-3, Dkk-4 and soggy (Fedi et a/., 1999 and Krupnick etai 1999). 

30 Human Dkk-1 was also referred to as sk (Fedi ef a/. 1999). As used herein, Dkk-1 is 
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meant to include proteins from any species having a Wnt pathway in which Dkk-1 
interacts. Particularly preferred are mammalian species (e.g., murine, caprine, 
canine, bovine, feline, equine, primate, ovine, porcine and the like), with particularly 
preferred mammals being humans. Nucleic acid sequences encoding Dkk-1 include, 
but are not limited to human Dkk-1 (GenBank Accession Nos. AH009834, 
XMJ305730, AF261 158, AF261 157, AF1 77394, AF1 27563 and NM_012242), Mus 
musculus dickkopf homolog 1 (GenBank Accession No. NM_0 10051), and Danio 
rerio dickkopf-1 (GenBank Accession Nos. AF1 16852 and AB023488). The genomic 
sequences with exon annotation are GenBank Accession Nos. AF261157 and 
AF261 158. Also contemplated are homologs of these sequences which have Dkk-1 
activity in the Wnt pathway. Dkk-1 amino acid sequences include, but are not limited 
to human dickkopf homolog 1 (GenBank Accession Nos. AAG15544, BAA34651 , 
NP_036374, AAF02674, AAD21087, and XP_005730), Danio rerio (zebrafish) 
dickkopf 1 (GenBank Accession Nos. BAA82135 and AAD22461) and murine 
dickkopf-1 (GenBank Accession Nos. 054908 and NP_034181). Variants and 
homologs of these sequences which possess Dkk-1 activity are also included when 
referring to Dkk-1 . 

By "Dkk mediated" disorder, condition or disease is any abnormal state that 
involves Dkk activity. The abnormal state can be induced by environmental 
exposure or drug administration. Alternatively, the disease or disorder can be due to 
a genetic defect. Dkk mediated diseases, disorders and conditions include but are 
not limited to bone mass disorders or conditions and lipid disorders and conditions. 
For example, bone mass disorders/conditions/diseases, which may be mediated by 
Dkk, include but are not limited to age related loss of bone, bone fractures (e.g., hip 
fracture, Colle's fracture, vertebral crush fractures), chondrodystrophies, drug- 
induced disorders (e.g., osteoporosis due to administration of glucocorticoids or 
heparin and osteomalacia due to administration of aluminum hydroxide, 
anticonvulsants, or glutethimide), high bone turnover, hypercalcemia, hyperostosis, 
osteogenesis imperfecta, osteomalacia, osteomyelitis, osteoporosis, Paget's 
disease, osteoarthritis, and rickets. 
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Lipid disorders/diseases/conditions, which may be mediated by Dkk, include 
but are not limited to familial lipoprotein lipase deficiency, familial apoprotein Cll 
deficiency, familial type 3 hyperlipoproteinemia, familial hypercholesterolemia, 
familial hypertriglyceridemia, multiple lipoprotein-type hyperlipidemia, elevated lipid 
levels due to dialysis and/or diabetes, and elevated lipid levels of unknown etiologies 

The term "recognizes and binds," when used to define interactions of 
antisense nucleotides, siRNAs (small inhibitory RNA),or shRNA (short hairpin RNA) 
with a target sequence, means that a particular antisense, siRNA, or shRNA 
sequence is substantially complementary to the target sequence, and thus will 
specifically bind to a portion of an mRNA encoding polypeptide. As such, typically 
the sequences will be highly complementary to the mRNA target sequence, and will 
have no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 base mismatches throughout the 
sequence. In many instances, it may be desirable for the sequences to be exact 
matches, i.e. be completely complementary to the sequence to which the 
oligonucleotide specifically binds, and therefore have zero mismatches along the 
complementary stretch. As such, highly complementary sequences will typically bind 
quite specifically to the target sequence region of the mRNA and will therefore be 
highly efficient in reducing, and/or even inhibiting the translation of the target mRNA 
sequence into polypeptide product. 

Substantially complementary oligonucleotide sequences will be greater than 
about 80 percent complementary (or % % exact-match') to the corresponding mRNA 
target sequence to which the oligonucleotide specifically binds, and will, more 
preferably be greater than about 85 percent complementary to the corresponding 
mRNA target sequence to which the oligonucleotide specifically binds. In certain 
aspects, as described above, it will be desirable to have even more substantially 
complementary oligonucleotide sequences for use in the practice of the invention, 
and in such instances, the oligonucleotide sequences will be greater than about 90 
percent complementary to the corresponding mRNA target sequence to which the 
oligonucleotide specifically binds, and may in certain embodiments be greater than 
about 95 percent complementary to the corresponding mRNA target sequence to 
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which the oligonucleotide specifically binds, and even up to and including 96%, 97%, 
98%, 99%, and even 100% exact match complementary to the target mRNA to 
which the designed oligonucleotide specifically binds. 

Percent similarity or percent complementary of any of the disclosed 
sequences may be determined, for example, by comparing sequence information 
using the GAP computer program, version 6.0, available from the University of 
Wisconsin Genetics Computer Group (UWGCG). The GAP program utilizes the 
alignment method of Needleman and Wunsch (1970). Briefly, the GAP program 

» 

defines similarity as the number of aligned symbols (i.e., nucleotides or amino acids) 
which are similar, divided by the total number of symbols in the shorter of the two 
sequences. The preferred default parameters for the GAP program include: (1) a 
unary comparison matrix (containing a value of 1 for identities and 0 for non- 
identities) for nucleotides, and the weighted comparison matrix of Gribskov and 
Burgess (1986), (2) a penalty of 3.0 for each gap and an additional 0.10 penalty for 
each symbol in each gap; and (3) no penalty for end gaps. 

By "mimetic" is meant a compound or molecule that performs the same 
function or behaves similarly to the compound mimicked. 

By "reporter element" is meant a polynucleotide that encodes a poplypeptide 
capable of being detected in a screening assays. Examples of polypeptides 
encoded by reporter elements include, but are not limited to, lacZ, GFP, luciferase, 
and chloramphenicol acetyltransferase. 

2. Introduction 

A polymorphism in LRP5 (Zmax), G171V, designated as HBM, has been 
identified as conferring a high bone mass phenotype in a population of related 
subjects as described in co-pending applications International Patent Application 
PCT/US 00/16951, and U.S. Patent Application Nos. 09/543,771 and 09/544,398, 
which are hereby incorporated by reference in their entirety (Little et aL, Am J Hum 
Genet 70:1 1-19 (2002)). LRP5 is also described in International Patent Application 
WO 98/46743, which is incorporated by reference in its entirety. Loss of LRP5 
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function has been shown to have a deleterious effect on bone (Gong etaL, Cell 
107:513-523 (2001)). Additionally, the HBM polymorphism and LRP5 may also be 
important in cardiac health and lipid-mediated disorders. Thus, methods of 
regulating their activity can serve as methods of treating and/or preventing cardiac 
and lipid-mediated disorders. 

Recent studies have indicated that LRP5 participates in the Wnt signal 
transduction pathway. The Wnt pathway is critical in limb early embryological 
development. A recently published sketch of the components of Wnt signaling is 
shown in Figure 1 

(Nusse, 2001 http://www.stanford.edu/-rnusse/pathways/cell2.html) (see also, 
Nusse, Nature 41 1 :255-6 (2001 ); and Mao et a/., Nature 41 1 :321-5 (2001 )). Briefly 
summarized, Wnt proteins are secreted proteins which interact with the 
transmembrane protein Frizzled (Fz). LRP proteins, such as LRP5 and LRP6, are 
believed to modulate the Wnt signal in a complex with Fz (Tamai et a/., Nature 
407:530-5 (2000)). The Wnt pathway acts intracellular^ through the Disheveled 
protein (Dsh) which in turn inhibits glycogen synthetase kinase-3 (GSK3) from 
phosphorylating (J-catenin. Phosphorylated (3-catenin is rapidly degraded following 
ubiquitination. However, the stabilized p-catenin accumulates and translocates to 
the nucleus where it acts as a cofactor of the T-cell factor (TCF) transcription 
activator complex. 

The protein dickkopf-1 (Dkk-1 ) is reported to be an antagonist of Wnt 
pathway. Dkk-1 is required for head formation in early development. Dkk-1 and its 
function in the Wnt pathway are described in e.g., Krupnik, et a/., Gene 238:301-13 
(1999); Fedi et a/., J. Biol. Chem. 274:19465-72 (1999); see also for Dkk-1 and the 
Wnt pathway, Wu et a/., Cum Biol. 1 0:161 1-4 (2000), Shinya et a/., Mech. Dev. 98:3- 
17 (2000), Mukhopadhyay et al, Dev Cell 1 :423-434 (2001 ) and in PCT Patent 
Application No. WO 00/52047, and in references cited in each. It has been known 
that Dkk-1 acts upstream of Dsh, however the nature of the mechanism of inhibition 
by Dkk-1 is just beginning to be elucidated. Dkk-1 is expressed in the mouse 
embryonic limb bud and its disruption results in abnormal limb morphogensis, among 
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other developmental defects (Gotewold era/., Mech. Dev. 89:151-3 (1999); and, 
Mukhopadhyay et al., Dev Cell 1 :423-434 (2001 )). 

Related U.S. provisional application 60/291,311 disclosed a novel interaction 
between Dkk-1 (GenBank Accession No. XM 005730) and LRP5. The interaction 
between Dkk-1 and LRP5 was discovered by a yeast two hybrid (Y2H) screen for 

* 

proteins which interact with the ligand binding domain of LRP5, as described in 

» 

Example 1. The two-hybrid screen is a common procedure in the art, which is 
described, for example, by Gietz etaL, Mol. Cell. Biochem. 172:67-79 (1997); 
Young, Biol. Reprod. 58:302-11 (1998); Brent and Finley, Ann. Rev. Genet 31:663- 
704 (1997); and Lu and Hannon, eds., Yeast Hybrid Technologies . Eaton Publishing, 
Natick MA, (2000). More recently, other studies confirm that Dkk-1 is a binding 
partner for LRP and modulates the Wnt pathway via direct binding with LRP (R. 
Nusse, Nature 41 1 :255-256 (2001 ); A. Bafico et a/., Nat. Cell Biol. 3:683-686 (2001 ); 
M. Semenov, Curr. Biol. 11:951-961 (2001); B. Mao, Nature 411:321-325 (2001), 
Zorn, Curr. Biol. 11:R592-5 (2001)); and, L. Li etal. f J. Biol Chem. 277:5977-81 
(2002)). 

Mao and colleagues (2001) identified Dkk-1 as a ligand for LRP6. Mao et al. 
suggest that Dkk-1 and LRP6 interact antagonistically where Dkk proteins inhibit the 
Wnt coreceptor functions of LRP6. Using co-immunoprecipitation, the group verified 
that the Dkk-1 /LRP6 interaction was direct. Dkk-2 was also found to directly bind 
LRP6. Contrary to data contained in provisional application 60/291 ,31 1 , Mao et al. 
report that no interaction was detected between any Dkk protein and LRP5, as well 
as no interaction with LDLR, VLDLR, ApoER, or LRP). Additionally, Mao et al. 
demonstrated that LRP6 can titrate Dkk-1's effects of inhibiting Wnt signaling using 
the commercial TCF-luciferase reporter gene assay (TOPFLASH). A similar 
conclusion was drawn from analogous studies in Xenopus embryos. Deletion 
analyses of LRP6 functional domains revealed that EGF repeats (beta-propellers) 3 
and 4 were necessary for Dkk-1 binding and that the ligand binding domains of LRP6 
had no effect on Dkk-1 binding. The findings of Mao et al. contrast with data 

■ 

obtained by the present inventors indication that the ligand binding domains of LRP5 
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were necessary and sufficient for Dkk-1 binding in yeast. Using classical 
biochemical ligand-receptor studies, Mao et al determined a Kd=0.34 nM for Dkk- 
1/LRP6 and a Kd=0.73 nM for Dkk-2/LRP6. 

Semenov et al (2001) verified the Mao group's results and confirmed by 
coimmunoprecipitation that Dkk-1 does not directly bind to Wnt or Frizzled but rather 
interacts with LRP6. Their Scatchard analyses found a Kd=0.5 nM for Dkk-1/LRP6. 
Semenov et al also demonstrated that Dkk-1 could abolish an LRP5/Frizzled8 
complex implying that Dkk-1 can also repress Wnt signaling via interactions with 
LRP5. A Dkk-1 mutant where cysteine 220 was changed to alanine abolished LRP6 
binding and was unable to repress Wnt signaling. Studies in Xenopus embryos 
confirmed the results and revealed a functional consequence of Dkk-1 /LRP6: 
repression of Wnt signaling. Their Xenopus work also suggested that LRP6/Dkk-1 
may be specific for the canonical, (3-catenin-mediated, Wnt pathways as opposed to 
the Wnt Planar Cell Polarity pathway. 

Bafico et al (2001) employed a 125 l-labeled Dkk-1 molecule to identify LRP6 
as its sole membrane receptor with a Kd=0.39 nM. Again, the functional 
consequences of the Dkk-1 /LRP6 interaction was a repression of the canonical Wnt 
signaling even when Dkk-1 was added at extremely low concentrations (30 pM). 

Not wishing to be bound by theory, it is believed that the present invention 
provides an explanation for the mechanism of Dkk-1 inhibition of the Wnt pathway 
and provides a mechanism whereby the Wnt pathway may be modulated. The 
present application and related provisional application 60/291,311 describe Dkk- 
1/LRP5 interactions and demonstrate that the interaction between LRP5/LRP6/HBM 
and Dkk can be used in a method as an intervention point in the Wnt pathway for an 
anabolic bone therapeutic or a modulator of lipid metabolism. 

As detailed below, in the section "Methods to Identify Binding Partners" and 
Examples 6 and 7, Dkk-1 is able to repress LRP5-mediated Wnt signaling but not 
HBM-mediated Wnt signaling. This observation is of particular interest because the 
HBM mutation in LRP5 is a gain of function or activation mutation. That is, Wnt 
signaling, via the canonical pathway, is enhanced with HBM versus LRP5. The 
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present data suggest the mechanism of this functional activation: the inability of Dkk- 
1 to repress HBM-mediated Wnt signaling. Further investigations of other Wnt or 
Dkk family members show differential activities in the canonical Wnt pathway that 
demonstrate the complexity and variability in Wnt signaling that can be achieved 
5 depending on the LRP/Dkk/Wnt/Frizzled repertoire that is expressed in a particular 
cell or tissue. This may attest to the apparent bone specificity of the HBM phenotype 
in humans and in the HBM transgenic animals. 

Furthermore, the present data reveal the importance and functional 
consequence for the potential structural perturbation of the first beta-propeller 

1 0 domain of LRP5. Our data identified the ligand binding domain of LRP5 as the 
interacting region with Dkk-1 while the Mao ef a/, publication demonstrated the 
functional role of propellers 3 and 4 in their LRP6/Dkk-1 studies. In the present 
invention, we implicate the first beta propeller domain, via the HBM mutation at 
residue 171, as having a functional consequence in the Dkk-1 -mediated Wnt 

15 pathway. The involvement of position 171 of propeller 1 may be direct or indirect 
with Dkk-1 . Direct involvement could arise from perturbations of the 3-dimensional 
structure of the HBM extracellular domain that render Dkk-1 unable to bind. 
Alternatively, residue 171 of propeller 1 may directly interact with Dkk-1; however, by 
itself, it is insufficient to bind and requires other LRP5 domains. Potential indirect 

20 candidate molecules may be among the proteins identified the Dkk-1 yeast-two- 
hybrid experiments. 

It may be that the disruption of Dkk activity is not necessarily mediated by 
enhancing or preventing the binding of Dkk to LRP5/LRP6/HBM. More than one 
mechanism may be involved. Indeed, the inventors have observed that Dkk-1 binds 

25 LRP5, LRP6, and HBM. It is able to effectively inhibit LRP6, and to a slightly lesser 
extent, LRP5 activity. Further, has been observed that different members of the Dkk 
family differentially affect LRP5/LRP6/HBM activity. For example, Dkk-1 inhibits 
LRP5/LRP6/HBM activity while another Dkk may enhance LRP5/LRP6/HBM activity. 
An endpoint to consider is the modulation of the LRP5/LRP6/HBM activity, not simply 

30 binding. 
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The present disclosure shows that targeting the disruption of the Dkk-1/LRP5 
interaction is a therapeutic intervention point for an HBM mimetic agent. A 
therapeutic agent of the invention may be a small molecule, peptide or nucleic acid 
aptamer, antibody, or other peptide/protein, etc. Methods of reducing Dkk-1 
expression may also be therapeutic using methodologies such as: RNA interference, 
antisense oligonucleotides, morpholino oligonucleotides, PNAs, antibodies to Dkk-1 
or Dkk-1 interacting proteins, decoy or scavenger LRP5 or LRP6 receptors, and 
knockdown of Dkk-1 or Dkk-1 interactor transcription. 

In an embodiment of the present invention, the activity of Dkk-1 or the activity 
of a Dkk-1 interacting protein may be modulated for example by binding with a 
peptide aptamer of the present invention. In another embodiment, LRP5 activity 
may be modulated by a reagent provided by the present invention (e.g., a peptide 
aptamer). In another embodiment, the Dkk-1 /LRP5 interaction may be modulated by 
a reagent of the present invention (e.g., a Dkk-1 interacting protein such as those 
identified in Figure 5). In another embodiment, the Wnt signal transduction pathway 
may be modulated by use of one or more of the above methods. In a preferred 
embodiment of the present invention, the Dkk-1 mediated activity of the Wnt 
pathway may be specifically modulated by one or more of the above methods. In 
another preferred embodiment of the present invention, the Wnt signal transduction 
pathway may be stimulated by down-regulating Dkk-1 interacting protein activity; 

» 

such down-regulation could, for example, yield greater LRP5 activity. In a more 
preferred embodiment, by stimulating LRP5 activity, bone mass regulation may be 
stimulated to restore or maintain a more optimal level. In another preferred 
embodiment, by stimulating LRP5 activity, lipid metabolism may be stimulated to 
restore or maintain a more optimal level. Alternative embodiments provide methods 
for screening candidate drugs and therapies directed to correction of bone mass 
disorders or lipid metabolism disorders. And, preferred embodiments of the present 
invention provide drugs and therapies developed by the use of the reagents and/or 
methods of the present invention. One skilled in the art will understand that the 
present invention provides important research tools to develop an effective model of 
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osteoporosis, to increase understanding of bone mass and lipid modulation, and to 
modulate bone mass and lipid metabolism. 

Previous investigation of a large family in which high bone mass is inherited 
as a single gene (autosomal dominant) trait (HBM-1) has provided important insight 
into the mechanism by which bone density might be modulated. Members of this 
family have significantly increased spinal and hip BMD (>3 standard deviations 
above the norm) which affects young adults as well as elderly family members into 
the ninth decade. The bones of affected members, while appearing very dense 
radiographically, have normal external shape and outer dimensions. Cortical bone is 
thickened on endosteal surfaces and "affected" individuals are asymptomatic without 
any other phenotypic abnormalities. Assays of biochemical markers that reflect 
skeletal turnover suggest that the disorder is associated with a normal rate of bone 
remodeling. Affected individuals have achieved a balance in bone turnover at a 
density that is significantly greater than necessary for normal skeletal stresses. 
Importantly, the bones most affected are load-bearing bones which are subjected to 
the greatest mechanical and gravitational stresses (spine and hip). These are the 
most important bones to target fir therapeutic interventions in osteoporosis. The 
gene identified as being responsible for this phenotype, Zmax or LRP5, was not 
previously associated with bone physiology. The fact that modification of this gene, 
such as that produced by the polymorphism leading to the autosomal dominant 
inheritance of the HBM family phenotype, identifies Zmax/LRP5 and the pathway by 
which it is regulated, including Dkk/Wnt pathways discussed above, as an important 
target for developing modulators of bone density. Modulation of Zmax/LRP5 to 
mimic the gain in function provided by the HBM polymorphism would be expected to 
provide an important therapy for bone wasting conditions. Additionally, such 
modulation in young adults could enhance peak bone mass and prevent or delay 
fracture risk later in life. Alternatively, modulation to reduce function could be 
employed to treat conditions where bone is being inappropriately produced. 
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3. Pol ypeptides 

Polypeptides contemplated for use in this invention include those which 
modulate Dkk and Dkk interacting protein activities. Preferred polypeptides and 
peptides include those which modulate the Wnt pathway. Examples of preferred 
sequences include the Y2H baits exemplified in Figure 2, peptide aptamers of Figure 
3 (SEQ ID NOs:171-188) and Figure 4 (SEQ ID NOs:189-192), the polypeptides of 
the Dkk-1 interacting proteins identified in Figure 5, those polypeptides shown in 
Figure 6, the LRP binding domain of Dkk (amino acids 138-266 of hDkkl ), the 



cysteine-rich domain 2 (a.a. 183-245 of hDkk-1), the cysteine-rich domain 1 (a.a. 97- 
138 of hDkk), and LRP5 binding aptamers of Figure 13 (including SEQ ID NOs:204- 
213). Although Dkk-1 is exemplified, the other Dkk proteins contain substantially 
similar regions and may also be used according to the present invention. 

For example, the baits depicted in Figure 2 were used in a yeast two hybrid 
(Y2H) screen. The Y2H screen was performed as described in Example 2 to 
determine the minimum required binding domain for Dkk-1 to bind LRP5. The 
minimum binding domain constructs {i.e., residues 139-266 in bold below and 
residues 97-245 which are underlined, of Dkk-1) include the second cysteine rich 
domain which has sequence homology to a colipase fold. 

mmalgaagat rvfvamvaaa Igghpllgvs atlnsvlnsn aiknlppplg gaaghpgsav 60 
saapgilypg gnkyqtidny qpypcaedee ccrtde ycasn trcradacrvqi clacrkrrkr 120 
cmrhamccpg nvckncricva sdonhfrcrei eetiteafcrn dhatldoyar rttlsskmyh 180 
tkcraecrsvcl rsadcasalc carhfwakic kpvlkeaovc tkhrrkashcr leifarcyca 240 

egMcriqkd hhqasnssri htcqrh (Gen Bank Accession No. XP_005730) (SEQ ID 
NO:128). 

This homology suggests a lipid-binding function and may facilitate Dkk-1 interactions 
at the plasma membrane (van Tilbeurgh, H., Biochim. Biophys. Acta. 1441:173-84 
(1999)). An interaction domain of Dkk-1 that is able to interact with the ligand 
binding domain (LBD) of LRP5 is a useful reagent in the modulation of LRP5 activity 
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and modulation of Dkk-1/LRP5 complex formation. Similar screens can be prepared 
for Dkk-1 and Dkk-1 interacting proteins or polypeptides. 

A set of peptide aptamers was identified from a library of random peptides 
constrained and presented in a thioredoxin A (trxA) scaffold as described in Example 
5 3. Peptide aptamers are powerful new tools for molecular medicine as reviewed by 
Hoppe-Seyler & Butz, J. Mol. Med., 78:426-430 (2000); Brody and Gold, Rev. Mol. 
Biotech., 74:5-13 (2000); and Colas, Curr. Opin. in Chem. Biol. 4:54-9 (2000) and 
the references cited therein. Briefly, peptide aptamers have been shown to be highly 
specific reagents capable of binding in vivo. As such, peptide aptamers provide a 

10 method of modulating the function of a protein and may serve as a substitute for 

conventional knock-out methods, knock-down or complete loss of function. Peptide 
aptamers are also useful reagents for the validation of targets for drug development 
and may be used as therapeutic compounds directly or provide the necessary 
foundation for drug design. Once identified, the peptide insert may be synthesized 

15 and used directly or incorporated into another carrier molecule. References 
reviewed and cited by Brody and Gold (2000, supra) describe demonstrated 
therapeutic and diagnostic applications of peptide aptamers and would be known to 
the skilled artisan. 

The peptide aptamers of the present invention are useful reagents in the 
20 binding of Dkk-1 to its ligands and thereby modulation of the Wnt pathway and may 
be used to prevent Dkk-1 from inhibiting LRP5 modulation or Dkk-1 interacting 
protein modulation of the Wnt pathway. The sequence of these peptide aptamers is 
shown in Figure 3 (SEQ ID NOs:171-188). The peptide aptamers refers to the 
peptide constrained by the thioredoxin scaffold. The aptamers are also 
25 contemplated as therapeutic agents to treat Dkk-1 mediated diseases and 

conditions. Such aptamers are useful structural guides to chemists, for the design of 
mimetic compounds of the aptamers. 

Peptide aptamers were likewise developed to the LRP5 ligand binding domain 
(LBD) bait sequences. The sequences of these peptide aptamers is shown in Figure 
30 4 (SEQ ID NOs:189-192). These are useful reagents which may be used to disrupt 
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the Dkk-1/LRP5 binding interface while leaving Dkk-1 undisturbed. These can be 
used as comparative controls for Wnt signaling, thus, a control is provided for the 
specificity of any drug or therapy screened. The aptamers are also useful 
therapeutic agents to treat LRP mediated diseases and conditions. Such aptamers 
may also be used as structural guides to chemists, for the design of mimetic 
compounds of the aptamers. 

Thirty proteins were identified which interact with Dkk-1, Dkk-1 interacting 
proteins, were identified in a yeast-two-hybrid screen using the Dkk-1 bait and are 
shown in Figure 5. It was noted that these results suggest an interaction of Dkk-1 
with Notch-2. It has been suggested that cross-talk exists between the Wnt and 
Notch signaling pathways. For instance, Presenilinl (Ps1) is required for Notch 
processing and inhibits the downstream Wnt pathway. The extracellular domain of 
Notch is thought to interact with Wnt. Furthermore, the Notch intracellular domain is 
thought to interact with disheveled and in signal induced processing, the intracellular 
domain is thought to interact with presenilin. (Soriano et a/., J. Cell Biol 152:785-94 
(2001)). For additional information regarding the relationships between Notch and 
Wnt signaling, see Wesley, Mol. Cell. Biol. 19:5743-58 (1999) and Axelrod et al. t 
Science 271 : 1826-32 (1 996). 

An interaction between Dkk-1 and chordin has also been noted; suggesting 
that cross-talk exists between the Wnt and TGF-beta/BMP signaling pathways 
(Letamendia et al. t J. Bone Joint Surg. Am. 83A:S31 (2001); Labbe etal, Proc. Natl. 
Acad. Sci. USA 97:8358-63 (2000); Nishita etal., Nature 403:781-5 (2000); 
DeRobertis ef a/., Int. J. Dev. Biol. 45:1389-97 (2001); and Saint-Jeannet et al., Proc. 
Natl. Acad. Sci. USA 94:13713-8 (1997)). The BMP signaling pathway has an 
established role in bone and connective tissue development, repair and homeostasis 
(review in Rosen and Wozney "Bone Morphogenetic Proteins" In: Principles of Bone 
Biology, 2 nd Edition, Eds. J. Bilezikian, L. Raisz and G. Rodan, Academic Press, pp. 
919-28 (2002)). Chordin is an important molecule during development which also 
modulates BMP signaling in adults by sequestering BMPs in latent complexes 
(Piccolo et al., Cell 86:589-98 (1996) reviewed in Reddi, Arthritis Res. 3:1-5 (2001); 
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DeRobertis et a/., Int. J. Dev. Biol. 45:189-97 (2001)). It may be that Dkk effects 
bone mass modulation through both the Wnt signaling pathway via LRP and the 
BMP pathway via chordin. 

Moreover, a number of putative growth factors, growth factor related proteins, 
and extracellular matrix proteins have been identified as Dkk-1 interacting proteins. 
Additional information regarding Dkk-1 interacting proteins identified in the Y2H 
assay may be obtained from publicly available databases such as PubMed via the 
use of the accession numbers provided in the present application. In a preferred 
embodiment of the invention, the amino acid sequences of these Dkk-1 interacting 
proteins or biologically active fragments thereof be used to modulate Dkk, Dkk-1 , 
LRP5, LRP6, HBM, or Wnt activity. Although these proteins were identified as 
interacting with Dkk-1 , due to the substantial homology between the various Dkk 
proteins, such interacting proteins are contemplated to interact with the other Dkk 
family members. 

4. Aptamer Mimetics 

The present invention further provides for mimetics of Dkk, particularly Dkk-1, 
and LRP5 peptide aptamers. Such aptamers may serve as structural guides to 
chemists for the design of mimetic compounds of the aptamers. The aptamers and 
their mimetics are useful as therapeutic agents to treat LRP- or Dkk-mediated 
diseases and conditions. 

5, Nucleic Acid Molecules 

The present invention further provides nucleic acid molecules that encode 
polypeptides and proteins which interact with Dkk and Dkk interacting proteins, 
and/or LRP5 (also LRP6 and HBM) to modulate biological activities of these 
proteins. Preferred embodiments provide nucleic acids encoding for fragments of 
Dkk-1 protein, including the nucleic acids of Figure 7, the Dkk-1 interacting proteins 
listed in Figure 5, polypeptide aptamers of Dkk-1 (Figure 3 - SEQ ID NOs:171-188), 
LRP5 (Figure 4 - SEQ ID NOs:189-192), Figure 13 peptide aptamers (including SEQ 
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ID NO:204-214) encoded by Figure 12 polynucleotides (including SEQ ID NO: 193- 
203), LRP6 and HBM and the related fusion proteins herein described, preferably in 
isolated or purified form. As used herein, "nucleic acid" is defined as RNA, DNA, or 
cDNA that encodes a peptide as defined above, or is complementary to a nucleic 
5 acid sequence encoding such peptides, or hybridizes to either the sense or 
antisense strands of the nucleic acid and remains stably bound to it under 
appropriate stringency conditions. The nucleic acid may encode a polypeptide 
sharing at least about 75% sequence identity, preferably at least about 80%, and 
more preferably at least about 85%, with the peptide sequences; at least about 90%, 

10 95%, 96%, 97%, 98%, and 99% or greater are also contemplated. Specifically 

contemplated are genomic DNA, cDNA, mRNA, antisense molecules, enzymatically 
active nucleic acids (e.g., ribozymes), as well as nucleic acids based on an 
alternative backbone or including alternative bases, whether derived from natural 
sources or synthesized. Such hybridizing or complementary nucleic acids, however, 

15 are defined further as being novel and nonobvious over any prior art nucleic acid 

including that which encodes, hybridizes under appropriate stringency conditions, or 
is complementary to a nucleic acid encoding a protein according to the present 
invention. 

As used herein, the terms "hybridization" (hybridizing) and "specificity" 
20 (specific for) in the context of nucleotide sequences are used interchangeably. The 
ability of two nucleotide sequences to hybridize to each other is based upon the 
degree of complementarity of the two nucleotide sequences, which in turn is based 
on the fraction of matched complementary nucleotide pairs. The more nucleotides in 
a given sequence that are complementary to another sequence, the greater the 
25 degree of hybridization of one to the other. The degree of hybridization also 

■ 

depends on the conditions of stringency which include temperature, solvent ratios, 
salt concentrations, and the like. In particular, "selective hybridization" pertains to 
conditions in which the degree of hybridization of a polynucleotide of the invention to 
its target would require complete or nearly complete complementarity. The 
30 complementarity must be sufficiently high so as to assure that the polynucleotide of 
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the invention will bind specifically to the target nucleotide sequence relative to the 
binding of other nucleic acids present in the hybridization medium. With selective 
hybridization, complementarity will be about 90-100%, preferably about 95-100%, 
more preferably about 100%. 

"Stringent conditions" are those that (1) employ low ionic strength and high 
temperature for washing, for example: 0.015 M NaCI, 0.0015 M sodium titrate, 0.1% 
SDS at 50°C; or (2) employ during hybridization a denaturing agent such as 
formamide, for example, 50% (vol/vol) formamide with 0.1% bovine serum albumin, 
0.1% Ficoll, 0.1% polyvinylpyrrolidone, 50 mM sodium phosphate buffer at pH 6.5 
with 750 mM NaCI, 75 mM sodium citrate at 42°C. Another example is use of 50% 
formamide, 5X SSC (0.75 M NaCI, 0.075 M sodium citrate), 50 mM sodium 
phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5X Denhardt's solution, sonicated 
salmon sperm DNA (50 pg/ml), 0.1% SDS, and 10% dextran sulfate at 42°C, with 
washes at 42°C in 0.2X SSC and 0.1% SDS. A skilled artisan can readily determine 
and vary the stringency conditions appropriately to obtain a clear and detectable 
hybridization signal. 

As used herein, a nucleic acid molecule is said to be "isolated" or "purified" 
when the nucleic acid molecule is substantially separated from contaminant nucleic 
acid encoding other polypeptides from the source of nucleic acid. Isolated or purified 
is also meant to include nucleic acids which encode Dkk or fragments thereof which 
lack surrounding genomic sequences that flank the Dkk gene. Isolated or purified is 
further intended to include nucleic acids which encode Dkk interacting proteins or 
biologically active fragments thereof which lack surrounding genomic sequences that 
flank the Dkk interacting protein genes. 

The present invention further provides fragments of the encoding nucleic acid 
molecule. As used herein, a fragment of an encoding nucleic acid molecule refers to 
a small portion of the entire protein encoding sequence. The size of the fragment 
will be determined by the intended use. For example, if the fragment is chosen so as 
to encode an active portion of the protein, the fragment will need to be large enough 
to encode the functional region(s) of the protein. If the fragment is to be used as a 



63 



WO 02/092015 



PCTYUS02/15982 



nucleic acid probe or PCR primer, then the fragment length is chosen so as to obtain 
a relatively small number of false positives during probing/priming. 

Fragments of the encoding nucleic acid molecules of the present invention 
(/.e. f synthetic oligonucleotides) that are used as probes or specific primers for the 
polymerase chain reaction (PCR), or to synthesize gene sequences encoding 
proteins of the invention can easily be synthesized by chemical techniques, for 
example, the phosphotriester method of Matteucci et al. (J. Am. Chem. Soc. 
103:3185-3191 (1981)) or using automated synthesis methods. In addition, larger 
DNA segments can readily be prepared by well known methods, such as synthesis 
of a group of oligonucleotides that define various modular segments of the gene, 
followed by ligation of oligonucleotides to build the complete modified gene. 

The polypeptide encoding nucleic acid molecules of the present invention may 
further be modified to contain a detectable label for diagnostic and probe purposes. 
A variety of such labels are known in the art and can readily be employed with the 
encoding molecules herein described. Suitable labels include, but are not limited to, 
biotin, radiolabeled nucleotides and the like. A skilled artisan can employ any of the 
art known labels to obtain a labeled encoding nucleic acid molecule. 

Modifications to the primary structure itself by deletion, addition, or alteration 
of the amino acids incorporated into the protein sequence during translation can be 
made without destroying the activity of the protein. Such substitutions or other 
alterations result in proteins having an amino acid sequence encoded by a nucleic 
acid falling within the contemplated scope of the present invention. 

Antisense molecules corresponding to the polypeptide coding or 
complementary sequence may be prepared. Methods of making antisense 
molecules which bind to mRNA, form triple helices or are enzymatically active and 
cleave TSG RNA and single stranded DNA (ssDNA) are known in the art. See, e.g., 
Antisense and Ribozyme Methodologv:Laboratorv Companion (Ian Gibson, ed., 
Chapman & Hall, 1997) and Ribozvme Protocols: Methods in Molecular Biology 
(Phillip C. Turner, ed., Humana Press, Clifton, NJ, 1997). 
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Also contemplated is the use of compounds which mediate postranscriptional 
gene silencing (PTGS), quelling and RNA interference (RNAi). These compounds 
typically are about 21 to about 25 nucleotides and are also known as short interfering 
RNAs or short inhibitory RNAs (siRNAs). The siRNAs are produced from an 
initiating double stranded RNA (dsRNA). Although the full mechanism by which the 
siRNAs function is not fully elucidated, it is known that these siRNAs transform the 
target mRNA into dsRNA, which is then degraded. Preferred forms are 5' 
phosphorylated siRNAs, however, hydroxylated forms may also be utilized. For 
additional background regarding the preparation and mechanism of siRNAs 
generally, see, e.g., Ljpardi era/., Cell 107(3): 297-307 (2001); Boutla etal., Curr. 
Biol. 11(22): 1776-80 (2001); Djikeng etal., RNA 7(11): 1522-30 (2001); Elbashir et 
a/., EMBO J. 20(23): 6877-88 (2001); Harborth etal., J. Cell. Sci. 1 14(Pt. 24): 4557- 
65 (2001); Hutvagner etal., Science 293(5531): 811-3 (2001); and Elbashir etal.. 
Nature 41 1:494-98 (2001). 

Also contemplated are short hairpin RNAs (shRNAs). shRNAs are a 
modification of the siRNA method described above. Instead of transfecting 
exogenously synthesized dsRNA into a cell, sequence-specific silencing can be 
achieved by stabling expressing siRNA from a DNA template as a fold-back stem- 
loop, or hairpin. This approach is known as shRNA. This method permits the 
analysis of loss of function phenotypes due to sequence-specific gene silencing in 
mammalian cells by avoiding many of the problems associated with siRNAs, such as 
RNase degradation of the reagents, expensive chemical synthesis, etc. For 
additional background regarding the preparation and mechanism of shRNAs 
generally, see, e.g., Yu et a/., PNAS 99:6047-6052 (2002); Paddison et a/., Genes 
andDevel. 16:948-58 (2002); and Brummelkamp et a/., Science 296:550-553 (2002). 
For additional background on the use of this method in mammalian gene knockdown 
methodologies, see Tuschl, Nature Biotech. 20:446-448 (2002) (and references 
therein). 

In one preferred embodiment, the siRNA or shRNA is directed to a Dkk 
encoding mRNA, wherein a preferred Dkk is Dkk-1. In another embodiment, the 
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siRNA or shRNA is directed towards a protein which binds to and modulates the 
activity of or is modulated by a Dkk; these proteins include LRP5, LRP6 and HBM as 
well as other members of the Wnt pathway. 

5 6. Isolation of Other Related Nucleic Acid Molecules 

The identification of the nucleic acid molecule of Dkk allows a skilled artisan to 
isolate nucleic acid molecules that encode other members of the Dkk family (see, 
Krupnik et al., 1999). Further, the presently disclosed nucleic acid molecules allow 
a skilled artisan to isolate nucleic acid molecules that encode Dkk-1 -like proteins, in 

10 addition to Dkk-1 . The presently disclosed Dkk-1 interacting proteins and their 

corresponding nucleic acid molecules allows a skilled artisan to further isolate other 
related protein family members which interact with Dkk-1 . 

A skilled artisan can readily use the amino acid sequence of Dkk and Dkk 
interacting proteins to generate antibody probes to screen expression libraries 

15 prepared from appropriate cells. Typically, polyclonal antiserum from mammals such 
as rabbits immunized with the purified protein (as described below) or monoclonal 
antibodies can be used to probe a mammalian cDNA or genomic expression library, 
such as a human macrophage library, to obtain the appropriate coding sequence for 
other members of the protein family. The cloned cDNA sequence can be expressed as 

20 a fusion protein, expressed directly using its own control sequences, or expressed by 
constructions using control sequences appropriate to the particular host used for 
expression of the desired protein. 

Alternatively, a portion of the coding sequence herein described can be 
synthesized and used as a probe to retrieve DNA encoding a member of the protein 

25 family from any mammalian organism. Oligomers containing approximately 18-20 

nucleotides (encoding about a 6-7 amino acid stretch) are prepared and used to screen 
genomic DNA or cDNA libraries to obtain hybridization under stringent conditions or 
conditions of sufficient stringency to eliminate an undue level of false positives. 
Additionally, pairs of oligonucleotide primers can be prepared for use in a 

30 polymerase chain reaction (PCR) to selectively clone an encoding nucleic acid 
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molecule. A PCR denature/anneal/extend cycle for using such PCR primers is well 
known in the art and can readily be adapted for use in isolating other encoding nucleic 
acid molecules. For example, degenerate primers can be utilized to obtain sequences 
related to Dkk-1 or Dkk-1 interacting proteins. Primers can be designed that are not 
5 perfectly complementary and can still hybridize to a portion of a target sequence or 
flanking sequence and thereby provide for amplification of all or a portion of a target 
sequence. Primers of about 20 nucleotides or less, preferably have about one to three 
mismatches located at the 5' and/or 3' ends. Primers of about 20 to 30 nucleotides 
have up to about 30% mismatches and can still hybridize to a target sequence. 

10 Hybridization conditions for primers with mismatch can be determined by the method 
described in Maniatis et a/., Molecular Cloning: A Laboratory Manual (Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY, 1982) or by reference to known methods. 
The ability of the primer to hybridize to a sequence of either Dkk-1 , a Dkk-1 interacting 
protein, or a related sequence under varying conditions can be determined using this 

1 5 method. Because a target sequence is known, the effect of mismatches can be 

determined by methods known to those of skill in the art. Degenerate primers would be 
based on putative conserved amino acid sequences of the Dkk-1 and Dkk-1 interacting 
protein genes. 

20 7. rDNA Molecules for Polypeptide Expression 

The present invention further provides recombinant DNA molecules (rDNAs) that 
contain a polypeptide coding sequence. As used herein, a rDNA molecule is a DNA 
molecule that has been subjected to molecular manipulation in situ. Methods for 
generating rDNA molecules are well known in the art, for example, see Sambrook et 

25 a/., Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory, Cold 

Spring Harbor, NY, 1989). In the preferred rDNA molecules, a coding DNA sequence 
is operably linked to expression control sequences and/or vector sequences. 

The choice of vector and/or expression control sequences to which one of the 
protein family encoding sequences of the present invention is operably linked depends 

30 directly, as is well known in the art, on the functional properties desired, e.g., protein 
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expression, and the host cell to be transformed. A vector contemplated by the present 
invention is at least capable of directing the replication and/or insertion into the host 
chromosome, and preferably also expression, of the structural gene included in the 
rDNA molecule. 

Expression control elements that are used for regulating the expression of an 
operably linked protein encoding sequence are known in the art and include, but are not 
limited to, inducible promoters, constitutive promoters, secretion signals, and other 
regulatory elements. Preferably, the inducible promoter is readily controlled, such as 
being responsive to a nutrient in the host ceirs medium. Preferred promoters include 
yeast promoters, which include promoter regions for metallothionein, 3- 
phosphoglycerate kinase or other glycolytic enzymes such as enolase or 
glyceraldehyde-3-phosphate dehydrogenase, enzymes responsible for maltose and 
galactose utilization, and others. Vectors and promoters suitable for use in yeast 
expression are further described in EP 73,675A. Appropriate non-native mammalian 
promoters might include the early and late promoters from SV40 (Fiers et al, Nature, 
273:113 (1978)) or promoters derived from Moloney murine leukemia virus, mouse 
tumor virus, avian sarcoma viruses, adenovirus II, bovine papilloma virus or polyoma. 
In addition, the construct may be joined to an amplifiable gene (e.g., DHFR) so that 
multiple copies of the gene may be made. For appropriate enhancer and other 
expression control sequences, see also Enhancers and Eukarvotic Gene Expression 
(Cold Spring Harbor Press, Cold Spring Harbor, NY, 1983). Preferred bone related 
promoters include CMVbActin or type I collagen promoters to drive expression of the 
human HBM, Zmax1/LRP5 or LRP6 cDNA. Other preferred promoters for mammalian 
expression are from cytomegalovirus (CMV), Rous sarcoma virus (RSV), Simian virus 
40 (SV40), and EF-1a (human elongation factor 1a-subunit). 

In one embodiment, the vector containing a coding nucleic acid molecule will 
include a prokaryotic replicon, /.e., a DNA sequence having the ability to direct 
autonomous replication and maintenance of the recombinant DNA molecule 
extrachromosomally in a prokaryotic host cell, such as a bacterial host cell, transformed 
therewith. Such replicons are well known in the art. In addition, vectors with a 
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prokaryotic replicon may also include a gene whose expression confers a detectable 
marker such as a drug resistance. Typical bacterial drug resistance genes are those 
that confer resistance to ampicillin or tetracycline. 

Vectors that include a prokaryotic replicon can further include a prokaryotic or 
bacteriophage promoter capable of directing the expression (transcription and 
translation) of the coding gene sequences in a bacterial host cell, such as E. ooli A 
promoter is an expression control element formed by a DNA sequence that permits 
binding of RNA polymerase and transcription to occur. Promoter sequences 
compatible with bacterial hosts are typically provided in plasmid vectors containing 
convenient restriction sites for insertion of a DNA segment of the present invention. 
Typical of such vector plasmids are pUC8, pUC9, pBR322 and pBR329 available from 
Biorad Laboratories, (Richmond, CA), and pPL and pKK223 available from Pharmacia 
(Piscataway, NJ). 

Expression vectors compatible with eukaryotic cells, preferably those compatible 
with vertebrate cells, can also be used to form a rDNA molecule that contains a coding 
sequence. Eukaryotic cell expression vectors are well known in the art and are 
available from several commercial sources. Typically, such vectors are provided 
containing convenient restriction sites for insertion of a desired DNA segment. Typical 
of such vectors are pSVL and pKSV-10 (Pharmacia), pBPV-1/pML2d (International 
Biotechnologies, Inc.), vector systems that include Histidine Tags and periplasmic 
secretion, or other vectors described in the art. 

Eukaryotic cell expression vectors used to construct the rDNA molecules of the 
present invention may further include a selectable marker that is effective in an 
eukaryotic cell, preferably a drug resistance selection marker. A preferred drug 
resistance marker is the gene whose expression results in neomycin resistance, i.e., 
the neomycin phosphotransferase (neo) gene (Southern et a/., J. Mol Anal. Genet 
1:327-341 (1982)). Alternatively, the selectable marker can be present on a separate 
plasmid, and the two vectors introduced by co-transfection of the host cell, and selected 
by culturing in the appropriate drug for the selectable marker. 
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8. Host Cells Containing an Exogenously Supplied rDNA Nucleic Acid 
Molecule 

The present invention further provides host cells transformed with a nucleic acid 
molecule that encodes a polypeptide or protein of the present invention. The host cell 
can be either prokaryotic or eukaryotic. Eukaryotic cells useful for expression of a 
protein of the invention are not limited, so long as the cell line is compatible with cell 
culture methods and compatible with the propagation of the expression vector and 
expression of the gene product. Preferred eukaryotic host cells include, but are not 
limited to, yeast, insect and mammalian cells, preferably vertebrate cells such as those 
from a mouse, rat, monkey or human cell line but also can include invertebrates with, 
for example, cartilage. Preferred eukaryotic host cells include but are not limited to 
Chinese hamster ovary (CHO) cells (ATCC No. CCL61), NIH Swiss mouse embryo 
cells NIH/3T3 (ATCC No. CRL 1658), baby hamster kidney cells (BHK), HOB-03-CE6 
osteoblast cells, and other like eukaryotic tissue culture cell lines. 

Any prokaryotic host can be used to express a rDNA molecule encoding a 
protein of the invention. A preferred prokaryotic host is E. coli. 

Transformation of appropriate cell hosts with a recombinant DNA (rDNA) 
molecule of the present invention is accomplished by well known methods that typically 
depend on the type of vector used and host system employed. With regard to 
transformation of prokaryotic host cells, electroporation and salt treatment methods are 
typically employed; see, for example, Cohen et a/., Proc. Natl. Acad. Sci. USA 69: 21 10 
(1972); Maniatis etal. (1982); and Sambrook et ai (1989). With regard to 
transformation of vertebrate cells with vectors containing rDNAs, electroporation, 
cationic lipid or salt treatment methods are typically employed; see, for example, 
Graham et a/., Virol 52: 456 (1973); Wigler et a/., Proc. Natl. Acad. Sci. USA 76: 1373- 
76 (1979). 

Successfully transformed cells, i.e., cells that contain a rDNA molecule of the 
present invention, can be identified by well known techniques including the selection for 
a selectable marker. For example, cells resulting from the introduction of an rDNA of 
the present invention can be cloned to produce single colonies. Cells from those 
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colonies can be harvested, lysed and their DNA content examined for the presence of 
the rDNA using a method such as that described by Southern, J. Mol. Biol. 98: 503 
(1975), or Berent et al., Biotech. 3: 208 (1985). Alternatively, the cells can be cultured 
to produce the proteins encoded by the rDNA and the proteins harvested and assayed, 
5 using for example, any suitable immunological method. See, e.g., Harlow et al., (1988). 

Recombinant DNA can also be utilized to analyze the function of coding and 
non-coding sequences. Sequences that modulate the translation of the mRNA can be 
utilized in an affinity matrix system to purify proteins obtained from cell lysates that 
associate with the Dkk-1 or Dkk-1 interacting protein or expression control sequence. 
1 0 Synthetic oligonucleotides would be coupled to the beads and probed with the lysates, 
as is commonly known in the art. Associated proteins could then be separated using, 
for example, a two dimensional SDS-PAGE system. Proteins thus isolated could be 
further identified using mass spectroscopy or protein sequencing. Additional methods 
would be apparent to the skilled artisan. 

15 

m 

9. Production of Recombinant Peptides and Proteins using a cDNA or Other 
Recombinant Nucleic Acids 

The invention also relates to nucleic acid molecules which encode a Dkk protein 
and polypeptide fragments thereof, and proteins and polypeptides which bind to Dkk 

20 (e.g., LRP5, LRP6 and HBM, Dkk interacting proteins such as the proteins of Figure 5) 
and molecular analogues. The polypeptides of the present invention include the full 
length Dkk and polypeptide fragments thereof, Dkk binding proteins and polypeptides 
thereof. Preferably these proteins are mammalian proteins, and most preferably 
human proteins and biologically active fragments thereof. Alternative embodiments 

25 include nucleic acid molecules encoding polypeptide fragments having a consecutive 
amino acid sequence of at least about 3, 5, 7, 8, 9, 10, 15, 20, 25, 30, 40, 50, 60, 70, 
80, 90, 100, 125, 150, 175, or 200 amino acid residues from a common polypeptide 
sequence; amino acid sequence variants of a common polypeptide sequence wherein 
an amino acid residue has been inserted N- or C-terminal to, or within, the polypeptide 

30 sequence or its fragments; and amino acid sequence variants of the common 
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polypeptide sequence or its fragments, which have been substituted by another 
conserved residue. Recombinant nucleic acid molecules which encode polypeptides 
include those containing predetermined mutations by, e.g., homologous recombination, 
site-directed or PCR mutagenesis, and recombinant Dkk proteins or polypeptide 
5 fragments of other animal species, including but not limited to vertebrates (e.g., rabbit, 
rat, murine, porcine, camelid, reptilian, caprine, avian, fish, bovine, ovine, equine and 
non-human primate species) as well as invertebrates, and alleles or other naturally 
occurring variants and homologs of Dkk binding proteins of the foregoing species and 
of human sequences. Also contemplated herein are derivatives of the commonly 

10 known Dkk, Dkk interacting proteins, or fragments thereof, wherein Dkk, Dkk 

interacting proteins, or their fragments have been covalently modified by substitution, 
chemical, enzymatic, or other appropriate means with a moiety other than a naturally 
occurring amino acid (for example a detectable moiety such as an enzyme or 
radioisotope) and soluble forms of Dkk. It is further contemplated that the present 

15 invention also includes nucleic acids with silent mutations which will hybridize to the 
endogenous sequence and which will still encode the same polypeptide. 

The nucleic acid molecules encoding Dkk binding proteins, the LRP5 binding 
domain fragment of Dkk, or other polypeptides of the present invention are preferably 
those which share a common biological activity (e.g., mediate Dkk activity such as its 

20 interaction with LRP5, HBM or LRP6). The polypeptides of the present invention 
include those encoded by a nucleic acid molecule with silent mutations, as well as 
those nucleic acids encoding a biologically active protein with conservative amino acid 
substitutions, allelic variants, and other variants of the disclosed polypeptides which 
maintain at least one Dkk activity. 

25 The amino acid compounds of the invention are polypeptides which are partially 

defined in terms of amino acid residues of designated classes. Polypeptide homologs 
would include conservative amino acid substitutions within the amino acid classes 
described below. Amino acid residues can be generally sub-classified into four major 
subclasses as follows: 
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Acidic: The residue has a negative charge due to loss of H + ion at physiological 
pH, and the residue is attracted by aqueous solution so as to seek the surface positions 
in the conformation of a peptide in which it is contained when the peptide is in aqueous 
medium, at physiological pH. 
5 Basic: The residue has a positive charge due to association with H + ion at 

physiological pH, and the residue is attracted by aqueous solution so as to seek the 
surface positions in the conformation of a peptide in which it is contained when the 
peptide is in aqueous medium at physiological pH. 

Neutral/non-polar: The residues are not charged at physiological pH, but the 
1 0 residue is repelled by aqueous solution so as to seek the inner positions in the 
conformation of a peptide in which it is contained when the peptide is in aqueous 
medium. These residues are also designated "hydrophobic." 

Neutral/polar: The residues are not charged at physiological pH, but the residue 
is attracted by aqueous solution so as to seek the outer positions in the conformation of 
15 a peptide in which it is contained when the peptide is in aqueous medium. 

It is understood, of course, that in a statistical collection of individual residue 
molecules some molecules will be charged, and some not, and there will be an 
attraction for or repulsion from an aqueous medium to a greater or lesser extent. To fit 
the definition of "charged", a significant percentage (at least approximately 25%) of the 
20 individual molecules are charged at physiological pH. The degree of attraction or 
repulsion required for classification as polar or nonpolar is arbitrary and, therefore, 
amino acids specifically contemplated by the invention have been classified as one or 
the other. Most amino acids not specifically named can be classified on the basis of 
known behavior. 

25 Amino acid residues can be further subclassified as cyclic or noncyclic, and 

aromatic or non-aromatic, self-explanatory classifications with respect to the side chain 
substituent groups of the residues, and as small or large. The residue is considered 
small if it contains a total of 4 carbon atoms or less, inclusive of the carboxyl carbon. 
Small residues are, of course, always nonaromatic. 



73 



WO 02/092015 



PCT/US02/15982 



The gene-encoded secondary amino acid proline, although technically within the 
group neutral/nonpolar/large/cyclic and nonaromatic, is a special case due to its known 
effects on the secondary conformation of peptide chains, and is not, therefore, included 
in this defined group. 

Other amino acid substitutions of those encoded in the gene can also be 
included in peptide compounds within the scope of the invention and can be classified 
within this general scheme according to their structure. 

All of the compounds of the invention may be in the form of the pharmaceutical^ 
acceptable salts or esters. Salts may be, for example, Na\ K\ Ca* 2 , Mg +2 and the like; 
the esters are generally those of alcohols of 1-6 carbons. 

The present invention further provides methods for producing a protein of the 
invention using nucleic acid molecules herein described. In general terms, the 
production of a recombinant form of a protein typically involves the following steps. 

First, a nucleic acid molecule is obtained that encodes Dkk, such as a nucleic 
acid molecule encoding human Dkk or any other Dkk sequence, or that encodes a Dkk 
binding protein, a Dkk aptamer or a biologically active fragment thereof. Particularly for 
Dkk binding peptides, the nucleotides encoding the peptide are incorporated into a 
nucleic acid in the form of an in-frame fusion, insertion into or appended to a thioredoxin 
coding sequence. The coding sequence (ORF) is directly suitable for expression in any 
host, as it is not interrupted by introns. 

These DNAs can be transfected into host cells such as eukaryotic cells or 
prokaryotic cells. Eukaryotic hosts include mammalian cells and vertebrate (e.g., 
osteoblasts, osteosarcoma cell lines, Drosophila S2 cells, hepatocytes, tumor cell lines 
and other bone cells of any mammal, as well as insect cells, such as Sf9 cells using 
recombinant baculovirus). For example, a DNA expressing an open reading frame 
(ORF) under control of a type I collagen promoter, or such osteoblast promoters as 
osteocalcin histone, type I collagen, TGF01, MSX2, cfos/cJun and Cbfal, can be used 
to regulate the Dkk in animal cells. Alternatively, the nucleic acid can be placed 
downstream from an inducible promoter, which can then be placed into vertebrate or 
invertebrate cells or be used in creating a transgenic animal model. 
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Alternatively, proteins and polypeptides of the present invention can be 
expressed in an heterologous system. The human cell line GM637, SV-40 transformed 
human fibroblasts, can be transfected, with a plasmid containing a Dkk ligand binding 
domain coding sequence under the control of the chicken actin promoter (Reis et a/., 
EMBO J. 1 1 : 185-193 (1992)). Such transfected cells could be used as a source of 
Dkk binding domain in functional assays. Alternatively, polypeptides encoding only a 
portion of Dkk or any of the disclosed Dkk binding peptides Dkk aptamers or a 
polypeptide encoding a Dkk interacting protein can be expressed alone or in the form of 
a fusion protein. For example, Dkk derived peptides can be expressed in bacteria (e.g., 
£ coli) as GST- or His-Tag fusion proteins. These fusion proteins are then purified and 
can be used to generate polyclonal antibodies or can be used to identify other Dkk 
ligands. 

The nucleic acid coding sequence is preferably placed in operable linkage with 
suitable control sequences, as described above, to form an expression unit containing 
the protein encoding open reading frame. The expression unit is used to transform a 
suitable host and the transformed host is cultured under conditions that allow the 
production of the recombinant protein. Optionally the recombinant protein is isolated 
from the medium or from the cells; recovery and purification of the protein may not be 
necessary in some instances where some impurities may be tolerated. 

Each of the foregoing steps can be done in a variety of ways. For example, the 
desired coding sequences may be obtained from genomic fragments and used directly 
in appropriate hosts. The construction of expression vectors that are operable in a 
variety of hosts is accomplished using appropriate replicons and control sequences, as 
set forth above. The control sequences, expression vectors, and transformation 
methods are dependent on the type of host cell used to express the gene and were 
discussed in detail earlier. Suitable restriction sites can, if not normally available, be 
added to the ends of the coding sequence so as to provide an excisable gene to insert 
into these vectors. A skilled artisan can readily adapt any host/expression system 
known in the art for use with the nucleic acid molecules of the invention to produce 
recombinant protein. 
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10. Methods to Identify Binding Partners 

Another embodiment of the present invention provides methods for use in 
isolating and identifying binding partners of Dkk or Dkk interacting proteins. Dkk or a 
Dkk interacting protein or a polypeptide fragment thereof can be mixed with a potential 
5 binding partner or an extract or fraction of a cell under conditions that allow the 

association of potential binding partners with Dkk or with Dkk interacting proteins. After 
mixing, the peptides, polypeptides, proteins or other molecules that have become 
associated with Dkk or a Dkk interacting protein are separated from the mixture. The 
binding partner that bound to the polypeptide then can be purified and further analyzed. 

1 0 Determination of binding partners of Dkk and Dkk interacting proteins as well as agents 
which prevent the interaction of Dkk with one of its interacting proteins (e.g., LRP5, 
LRP6, HBM, or those proteins listed in Figure 5) can be performed using a variety of 
different competition assays as are known in the art. For example, the minimal 
sequence of Dkk, as described herein, can be used to identify antibodies which 

1 5 compete with LRP5 (or LRP6, HBM or other ligand binding partners) for binding to Dkk- 
1 and vice versa. The minimal Dkk sequence can be bound to the bottom of a 96-well 
plate (or other solid substrate), and antibodies or other potential binding agents (e.g., 
polypeptides, mimetics, homologs, antibody fragments and the like) can be screened in 
a competition assay to identify agents with binding affinities, for example, greater than 

20 the natural ligand binding partner of Dkk. 

In the present invention, suitable cells are used for preparing assays, for the 
expression of a LRP and/or Dkk or proteins that interact therewith. The cells may be 
made or derived from mammals, yeast, fungi, or viruses. A suitable cell for the 
purposes of this invention is one that includes but is not limited to a cell that can exhibit 

25 a detectable Dkk-LRP (or HBM) interaction, and preferably, the differential interaction 
between Dkk-1-LRP5 and Dkk-1-HBM. For the desired assay, the cell type may vary. 
In several embodiments, bone cells are preferred, for example, a human osteoblast cell 
(e.g. hOB-03-CE6) or osteosarcoma cell ( e.g. U20S). Additional hOB cells are hOB- 
03-C5, hOB-02-02 and, an immortalized pre-osteocytic cell line referred to as hOB-01- 

30 C1-PS-09 cells (which are deposited with American Type Culture Collection in 
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Manassas, Va. with the designation PTA-785), Examples of osteosarcoma cells would 
include SaoS2, MG63 and HOS TE85 Immortalized refers to a substantially 
continuous and permanently established cell culture with substantially unlimited cell 
division potential. That is, the cells can be cultured substantially indefinitely, i.e., for at 
least about 6 months under rapid conditions of growth, preferably much longer under 
slower growth conditions, and can be propagated rapidly and continually using routine 
cell culture techniques. Alternatively stated, preferred cells can be cultured for at least 
about 100, 150 or 200 population doublings. These cells produce a complement of 
proteins characteristic of normal human osteoblastic cells and are capable of 
osteoblastic differentiation. They can be used in cell culture studies of osteoblastic cell 
sensitivity to various agents, such as hormones, cytokines, and growth factors, or in 
tissue therapy. Certain non bone cells such as HEK 293 cells that exhibit detectable 
Dkk-LRP (or HBM) interaction are also be useful for the assays of this invention. 

To identify and isolate a binding partner, the entire Dkk protein (e.g., human 
Dkk-1, GenBank Accession No. BAA34651) or a Dkk interacting protein (Genbank 
Accession Nos. for some Dkk-1 interacting proteins are given in Figure 5) can be used. 
Alternatively, a polypeptide fragment of the protein can be used. Suitable fragments of 
the protein include at least about 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 
65, 70, 75, 80, 85, 90, 95, 100, 110, 120, 130, 140, 150 or more contiguous amino acid 
residues of any Dkk or Dkk interactor sequence. Preferable sequences of Dkk include 
portions or all of one or both of the cysteine rich domains (e.g., Cys-1 and Cys-2 of 
Dkk-1 ) or the conserved sequences at the amino terminus of Dkk-1 (See Krupnik et a/., 
Gene 238: 301-313 (1999)). Alternatively, portions of LRP5, LRP6, HBM and other Dkk 
interacting proteins such as those in Figure 5 that interact with Dkk-1 can be used to 
identify and isolate agents which modulate Dkk activity. Alternatively, peptide aptamers 
of LRP5, LRP6, HBM, Dkk and other Dkk interacting proteins such as those in Figure 5 
that interact with Dkk-1 can be used to identify and isolate agents which modulate Dkk 
activity. 

As used herein, a cellular extract refers to a preparation or fraction which is 
made from a lysed or disrupted cell. A variety of methods can be used to obtain cell 
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extracts. Cells can be disrupted using either physical or chemical disruption methods. 
Examples of physical disruption methods include, but are not limited to, sonication and 
mechanical shearing. Examples of chemical lysis methods include, but are not limited 
to, detergent lysis and enzyme lysis. A skilled artisan can readily adapt methods for 
5 preparing cellular extracts in order to obtain extracts for use in the present methods. 

Once an extract of a cell is prepared, the extract is mixed with the protein of the 
invention under conditions in which association of the protein with the binding partner 
can occur. A variety of conditions can be used, the most preferred being conditions 
that closely resemble conditions found in the cytoplasm of a human cell. Features such 

10 as osmolarity, pH, temperature, and the concentration of cellular extract used, can be 
varied to optimize the association of the protein with the binding partner. 

After mixing under appropriate conditions, the bound complex is separated from 
the mixture. A variety of techniques can be utilized to separate the mixture. For 
example : antibodies specific to a protein of the invention can be used to 

15 immunoprecipitate the binding partner complex. Alternatively, standard chemical 

separation techniques such as chromatography and density/sediment centrifugation 
can be used. For example, a protein of the invention is expressed with an affinity tag 
such as a His tag. The His labeled protein and any bound molecule may be retained 
and selectively eluted from a Ni-NTA column. 

20 After removal of non-associated cellular constituents found in the extract, the 

binding partner can be dissociated from the complex using conventional methods. For 
example, dissociation can be accomplished by altering the salt concentration or pH of 
the mixture. 

To aid in separating associated binding partner pairs from the mixed extract, the 
25 protein of the invention can be immobilized on a solid support. For example, the protein 
can be attached to a nitrocellulose matrix or acrylic beads. Attachment of the protein to 
a solid support aids in separating peptide/binding partner pairs from other constituents 
found in the extract. The identified binding partners can be either a single protein or a 
complex made up of two or more proteins. 
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Alternatively, the nucleic acid molecules of the invention can be used in a Y2H 
system. The Y2H system has been used to identify other protein partner pairs and can 
readily be adapted to employ the nucleic acid molecules herein described. Methods of 
performing and using Y2H systems are known. See, e.g., Finley et a/., "Two-Hybrid 
5 Analysis of Genetic Regulatory Networks," in The Yeast Two-Hvbrid System (Paul L. 
Bartel et al., eds., Oxford, 1997); Meijia Yang, "Use of a Combinatorial Peptide Library 
in the Two-Hybrid Assay," in The Yeast Two-Hvbrid System (Paul L. Bartel et al., eds., 
Oxford, 1997); Gietz et a/., "Identification of proteins that interact with a protein of 
interest: Applications of the yeast two-hybrid system," Mol. & Cell. Biochem. 172: 67-9 

10 (1997); K. H. Young, "Yeast Two-Hybrid: So Many lnteractions,(in) so Little Time," Biol. 
Reprod. 58: 302-311 (1998); R. Brent et a/., "Understanding Gene and Allele Function 
with Two-Hybrid Methods," Annu. Rev. Genet. 31:663-704 (1997) and U.S. Patent No. 
5,989,808. The Dkk-1 interacting proteins identified in Figure 5 were identified using the 
Y2H interacting system using Dkk-1 as bait. 

1 5 One preferred in vitro binding assay for Dkk modulators would comprise a 

mixture of a LRP binding domain of Dkk and one or more candidate binding targets or 
substrates. After incubating the mixture under appropriate conditions, one would 
determine whether Dkk or a fragment thereof bound with the candidate modulator 
present. For cell-free binding assays, one or more of the components usually 

20 comprises or is coupled to a label. The label may provide for direct detection, such as 
radioactivity, luminescence, optical or electron density, etc., or indirect detection such 
as an epitope tag, an enzyme, etc. A variety of methods may be employed to detect 
the label depending on the nature of the label and other assay components. For 
example, the label may be detected bound to the solid substrate or a portion of the 

25 bound complex containing the label may be separated from the solid substrate, and the 
label thereafter detected. Fluorescence resonance energy transfer may be utilized to 
monitor the interaction of two labeled molecules. For example, a fluorescence label on 
Dkk and another label on LRP5 or a soluble fragment thereof such as the extracellular 
domain will exchange fluorescence resonance energy when in close proximity 

30 indicating that the two molecules are bound. A preferred binding partner for Dkk will 
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increase or decrease the affinity between Dkk and LRP5 which will be readily 
observable in a fluorescence spectrometer. Alternatively, an instrument, such as a 
surface plasmon resonance detector manufactured by BIAcore (Uppsala, Sweden), 
may be used to observe interactions with a fixed target. One skilled in the art knows of 
5 many other methods which may be employed for this purpose. 

Thereby, the present invention provides methods for screening candidates 
including polypeptides of the present invention for activity which identifies these 
candidates as valuable drug leads. Other suitable methods are also known in the art 
and are suitable for use herein, including Xenopus oocyte injection studies and TCF 

1 0 luciferase assays. 

Additional assays can be used to identify the activity of Dkk and Dkk interacting 
proteins in the Wnt pathway, as well as the impact of modulators of Dkk and Dkk 
interacting proteins on the Wnt pathway. These include, for example, a Xenopus 
embryo assay and a TCF-luciferase reporter gene assay to monitor Writ signaling 

15 modulation. 

Xenopus embryos are an informative in vivo assay system to evaluate the 
modulation of Wnt signaling. Ectopic expression of certain Wnts or other activators of 
the Wnt signaling pathway results in a bifurcation of the anterior neural plate. This 
bifurcation results in a duplicated body axis, which suggests a role for Wnt signaling 

20 during embryonic development (McMahon et a/., Cell 58: 1075-84 (1989); Sokol et a/., 
Cell 67: 741-52 (1991)). Since these original observations, the Xenopus embryo assay 
has been extensively used as an assay system for evaluating modulation of the Wnt 
signaling pathway. One preferred embodiment of the present invention is 
demonstrated in Example 6. 

25 Constructs for Xenopus expression can be prepared as would be known in the 

art. For example, a variety of cDNAs have been engineered into the vector pCS2+ 
(Turner et a/., Genes Devel 8: 1434-1447 (1994)) to facilitate the in vitro generation of 
mRNA for use in Xenopus embryo injection experiments. DNA inserts are subcloned in 
the sense orientation with respect to the vector SP6 promoter. Downstream of the 

30 insert, the vector provides an SV40 virus polyadenlylation signal and a T3 promoter 
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sequence (/.e., for the generation of antisense mRNA). Constructs can be generated 
for various Dkk family members, LRP5, LRP6, HBM, Dkk-1 interactors, etc. Constructs 
could be generated in pCS2 + that contain the nucleic acid sequence encoding for the 
peptide aptamers that were identified in yeast screens. These sequences would be 
fused to a 5' synthetic translation initiation sequence followed by a canonical signal 
sequence to ensure that the peptide aptamer would be translated and secreted from 
the cell. 

Once these constructs are made then mRNA can be synthesized and injected 
into Xenopus oocytes. mRNA for microinjection into Xenopus embryos is generated by 
in vitro transcription using the cDNA constructs in the pCS2* vector described above as 
template. Various amounts of RNA can be injected into the ventral blastomere of the 4- 
or 8-cell Xenopus embryo substantially as described in Moon et a/., Technique-J. of 
Methods in Cell and MoL Biol. 1: 76-89 (1989), and Peng, Meth. Cell. Biol. 36: 657-62 
(1991). 

Previous data has shown that expression of LRP5, in the presence of Wnt5a, 
results in a Wnt-induced duplicated axis formation in Xenopus embryos (Tamai et a/., 
Nature 407: 530-535 (2000)). The roles of Dkk-1 and Dkk-2, and Dkk-1 interacting 
proteins, in modulating the LRP5-mediated Wnt response in vivo can be analyzed 
using, for example, the Xenopus embryo. In addition, the peptide aptamers, Dkk 
interacting proteins, or combinations of the above can be evaluated in a similar manner. 

Experiments can also be conducted wherein RNA is injected into the dorsal 
blastomere to ensure the specificity of the observed phenotypes. Lineage tracing 
experiments can be performed where a marker gene such as green fluorescent protein 
(GFP) or LacZ is co-injected with the experimental RNAs. Detecting marker gene 
expression would identify the targeted cells of the microinjection and aid in elucidating 
the mechanism of action. In addition to the Wnt signaling components listed above, the 
point at which HBM acts upon the Wnt pathway can also be analyzed. This can be 
done by co-injections of various dominant-negative constructs. For example, a 
dominant negative TCF-3 construct would be useful to demonstrate that the observed 
axis duplication (and Wnt activation) is mediated via the (5-catenin-TCF response. If so, 
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such a construct would be expected to abolish the observed duplicated axis phenotype. 
Another example would include a dominant negative Dsh construct. Since Dsh is far 
upstream in the Wnt signaling pathway, a dominant negative construct should abolish 
the activation of the Wnt response and the observed axis duplication. If it does not, this 
would suggest that axis duplication is being induced via a different signaling pathway. 

The marker genes of the injected Xenopus embryos can be analyzed as follows. 
Representative embryos are collected at stage 10.5 (1 1 hours post fertilization) for 
marker gene analysis. RNA is extracted and purified from the embryos following 
standard protocols (Sambrook ef a/., 1989 at 7.16). Marker genes could include the 
following: Siamois (i.e., Wnt responsive gene), Xnr3 (i.e., Wnt responsive gene), slug 
(i.e., neural crest marker), Xbra (i.e., early mesoderm marker), HNK-1 (/.e. f 
ectodermal/neural marker), endodermin (i.e., endoderm), Xlhbox8 (i.e., pancreatic), 
BMP2 and BMP4 (i.e., early mesoderm), XLRP6 (i.e., maternal and zygotic expression, 
it is also the LRP6 homolog in the frog), EF-1 (i.e., control) and ODC (i.e., control). 
Induction of marker genes is analyzed and quantitated by RT-PCR/TaqMan®. 

This type of marker analysis is excellent to monitor changes in gene expression 
that result very early in the embryo as a direct result of signaling perturbation. Other 
experiments could be designed that would monitor changes in gene expression in a 
more tissue or spatially-restricted fashion. Examples would include the generation of a 
transgenic Xenopus model. For example, Zmax/LRP5 and HBM expression could be 
under the control of the brachyury or cardiac-actin promoters directing gene expression 
transiently in the mesoderm or in the somites, respectively. Phenotype analyses of 
these transgenic Xenopus animals would include marker gene analysis/transcriptional 
profiling (from a restricted tissue source) and histologic examination of the tissue. 

A TCF-luciferase assay system such as that described in Example 7 can also be 
used to monitor Wnt signaling activity, Dkk activity and Dkk interacting protein activity. 
Constructs for the TCF-luciferase assays can be prepared as would be known in the 
art. For example, Dkk and Dkk interacting protein peptides, LRP5/LRP6, among 
others, can be expressed in pcDNA3.1, using Kozak and signal sequences to target 
peptides for secretion. 
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Once constructs have been prepared, cells such as osteoblasts and HEK293 
cells are seeded in well plates and transfected with construct DNA, CMV beta- 
galactosidase plasmid DNA, and TCF-luciferase reporter DNA. The cells are then 
(ysed and assayed for beta-galactosidase and luciferase activity to determine whether 
Dkk, Dkk interacting proteins, or other molecules such as antibodies affect Wnt 
signaling. 

Additional assays for monitoring Wnt signaling activity, Dkk activity, and Dkk 
interacting protein activity include: 

Modulation of another Wnt-responsive transcription factor, LEF, as 
visualized by a reporter gene activity. One example includes the activation of 
the LEF1 promoter region fused to the luciferase reporter gene (Hsu et a/., Mol. 
Cell. Biol. 1 8: 4807-1 8 (1 999)). 

Alterations in cell proliferation, cell cycle or apoptosis. There are 
numerous examples describing Wnt-mediated cellular transformations including 
Shimizu et a/., Cell. Growth Differ. 8: 1349-58 (1997). 

Stabilization and cellular localization of de-phosphorylated (3-catenin as 
an indicator of Wnt activation (Shimizu et al., 1997). 
Additional methods of assaying Wnt signaling, through either the canonical or 
non-canonical pathways, would be apparent to the artisan of ordinary skill. 

11. Methods to Identify Agents that Modulate the Expression of a Nucleic Acid 
Encoding the Dkk and/or LRP5 Proteins and/or Dkk interacting proteins 

Another embodiment of the present invention provides methods for identifying 
agents that modulate the expression of a nucleic acid encoding Dkk. Such assays may 
utilize any available means of monitoring for changes in the expression level of the 
nucleic acids of the invention. As used herein, an agent is said to modulate the 
expression of Dkk, if it is capable of up- or down-regulating expression of the nucleic 
acid in a cell (e.g., mRNA). 

In one assay format, cell lines that contain reporter gene fusions between the 
nucleic acid encoding Dkk (or proteins which modulate the activity of Dkk) and any 



83 



WO 02/092015 



PCT/US02/15982 



assayable fusion partner may be prepared. Numerous assayable fusion partners are 
known and readily available, including but not limited to the firefly luciferase gene and 
the gene encoding chloramphenicol acetyltransferase (Alam et a/., Anal. Biochem. 188: 
245-254 (1990)). Cell lines containing the reporter gene fusions are then exposed to 
the agent to be tested under appropriate conditions and time. Differential expression of 
the reporter gene between samples exposed to the agent and control samples identifies 
agents which modulate the expression of a nucleic acid encoding Dkk or other protein 
which modulates Dkk activity. Such assays can similarly be used to determine whether 
LRP5 and even LRP6 activity is modulated by regulating Dkk activity. 

Additional assay formats may be used to monitor the ability of the agent(s) to 
modulate the expression of a nucleic acid encoding Dkk, alone or Dkk and LRP5, 
and/or Dkk interacting proteins such as those identified in Figure 5. For instance, 
mRNA expression may be monitored directly by hybridization to the nucleic acids of the 
invention. Cell lines are exposed to the agent to be tested under appropriate conditions 
and time and total RNA or mRNA is isolated by standard procedures such those 
disclosed in Sambrook etal. (1989); Ausubel et al. t Current Protocols in Molecular 
Biology (Greene Publishing Co., NY, 1995); Maniatis et a/. F Molecular Cloning: A 
Laboratory Manual (Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1982); 
and Short Protocols in Molecular Biologv: A Compendium of Methods from Current 
Protocols in Molecular Biology (Frederick M. Ausubel et a/., April 1999). 

Probes to detect differences in RNA expression levels between cells exposed to 
the agent and control cells may be prepared from the nucleic acids of the invention. It 
is preferable, but not necessary, to design probes which hybridize only with target 
nucleic acids under conditions of high stringency. Only highly complementary nucleic 
acid hybrids form under conditions of high stringency. Accordingly, the stringency of 
the assay conditions determines the amount of complementarity which should exist 
between two nucleic acid strands in order to form a hybrid. Stringency should be 
chosen to maximize the difference in stability between the probe:target hybrid and 
potential probe: non-target hybrids. 
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Probes may be designed from the nucleic acids of the invention through 
methods known in the art. For instance, the G+C content of the probe and the probe 
length can affect probe binding to its target sequence. Methods to optimize probe 
specificity are commonly available. See for example, Sambrook et al. (1989) or 
Ausubel era/. ( Current Protocols in Molecular Biology. Greene Publishing Co., NY, 
1995). 

Hybridization conditions are modified using known methods, such as those 
described by Sambrook et al. (1989) and Ausubel et al. (1995), as suitable for each 
probe. Hybridization of total cellular RNA or RNA enriched for polyA RNA can be 
accomplished in any available format. For instance, total cellular RNA or RNA enriched 
for polyA RNA can be affixed to a solid support and the solid support exposed to at 
least one probe comprising at least one, or part of one of the nucleic acid sequences of 
the invention under conditions in which the probe will specifically hybridize. 
Alternatively, nucleic acid fragments comprising at least one, or part of one of the 
sequences of the invention can be affixed to a solid support, such as a porous glass 
wafer. The glass or silica wafer can then be exposed to total cellular RNA or polyA 
RNA from a sample under conditions in which the affixed sequences will specifically 
hybridize. Such glass wafers and hybridization methods are widely available, for 
example, those disclosed by Beattie (WO 95/1 1755). By examining for the ability of a 
given probe to specifically hybridize to an RNA sample from an untreated cell 
population and from a cell population exposed to the agent, agents which up- or down- 
regulate the expression of a nucleic acid encoding Dkk, a Dkk interacting protein, 
and/or LRP5 can be identified. 

Microarray technology and transcriptional profiling are examples of methods 
which can be used to analyze the impact of putative Dkk or Dkk interacting protein 
modulating compounds. For transcriptional profiling, mRNA from cells exposed in vivo 
to a potential Dkk modulating agent, such as the Dkk interacting proteins identified in 
the present invention (e.g., those identified in Figure 5), agents which modulate Dkk 
interacting proteins, and mRNA from the same type of cells that were not exposed to 
the agent could be reverse transcribed and hybridized to a chip containing DNAfrom 
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numerous genes, to thereby compare the expression of genes in cells treated and not 
treated with the agent. If, for example a putative Dkk modulating agent down-regulates 
the expression of Dkk in the cells, then use of the agent may be undesirable in certain 
patient populations. For additional methods of transcriptional profiling and the use of 
5 microarrays, refer to, for example, U.S. Patent No. 6,124,120 issued to Lizardi (2000). 

Additional methods for screening the impact of Dkk and Dkk interacting protein 
modulating compounds or the impact of Dkk or Dkk interacting proteins on modulation 
of LRP5, LRP6, HBM or the Wnt pathway include the use of TaqMan PCR, 
conventional reverse transcriptase PCR (RT-PCR), changes in downstream surrogate 
10 markers {i.e., Wnt responsive genes), and anti-Dkk Western blots for protein detection. 
Other methods would be readily apparent to the artisan of ordinary skill. 

12. Methods to Identify Agents that Modulate at Least One Activity of Dkk. a 
Dkk Interacting Protein, or LRP5/LRP6/HBM 

1 5 Another embodiment of the present invention provides methods for identifying 

agents that modulate at least one activity of Dkk, Dkk interacting proteins, and/or 
LRP5/LRP6/HBM proteins or preferably which specifically modulate an activity of a 
Dkk/Dkk interacting protein complex or an LRP5(or LRP6/HBM)/Dkk complex, or a 
biologically active fragment of Dkk (e.g., comprising the domain which binds 

20 LRP5/LRP6/HBM) or a Dkk interacting protein complex. Such methods or assays may 
utilize any means of monitoring or detecting the desired activity as would be known in 
the art (See, e.g., Wu et a/., Curr. Biol. 10:1611-4 (2000); Fedi et a/., J. Biol. Chem. 
274:19465-72 (1991); Grotewold et a/., Mech. Dev. 89:151-3 (1999); Shibata ef a/., 
Mech. Dev. 96:243-6 (2000); Wang etal. t Oncogene 19:1843-8 (2000); and Glinka et 

25 a/., Nature 391:357-62 (1998)). Potential agents which modulate Dkk include, for 

example, p53, the tumor suppressor protein, which can induce Dkk-1 . Damage to DNA 
has also been observed to up-regulate Dkk-1 expression via a stabilization and 
activation of p53 (Wang et a/., Oncogene 19:1843-48 (2000)); and, Shou et a/., 
Oncogene 21:878-89 (2002)). Additionally, Fedi et al. (1999) purportedly showed that 

30 Dkk-1 can block the Wnt2-induced oncogenic transformation of NIH-3T3 cells. 
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Furthermore, it has been suggested that Dkk expression can be modulated by BMP 
signaling in the developing skeleton (Mukhopadhyay et al. , Dev. Cell. 1 :423-34 (2001 ); 
and Grotewold et a/., EMBO J. 21 :966-75 (2002)). Grotewold et al. additionally 
describe altered Dkk expression levels in response to stress signals including UV 
irradiation and other genotoxic stimuli. They propose that Dkk expression is pro- 
apoptotic. In animals expressing HBM constructs conferring high bone mass, a 
reduced osteoblast apoptosis effect was observed. Thus, HBM and HBM-like variants 
may control/alter Dkk's role in programmed cell death. Other agents which potentially 
modulate Dkk activity include the Dkk interacting proteins identified in Figure 5. 

In one embodiment, the relative amounts of Dkk or a Dkk interacting protein of a 
cell population that has been exposed to the agent to be tested is compared to an un- 
exposed control cell population. Antibodies can be used to monitor the differential 
expression of the protein in the different cell populations. Cell lines or populations are 
exposed to the agent to be tested under appropriate conditions and time. Cellular 
lysates may be prepared from the exposed cell line or population and a control, 
unexposed cell line or population. The cellular lysates are then analyzed with the 
probe, as would be known in the art. See, e.g., Ed Harlow and David Lane, 
Antibodies: A Laboratory Manual (Cold Spring Harbor, NY, 1988) and Ed Harlow and 
David Lane, Using Antibodies: A Laboratory Manual (Cold Spring Harbor, NY 1998). 

For example, N- and C- terminal fragments of Dkk can be expressed in bacteria 
and used to search for proteins which bind to these fragments. Fusion proteins, such 
as His-tag or GST fusion to the N- or C-terminal regions of Dkk (or to biologically active 
domains of Dkk-1 ) or a whole Dkk protein can be prepared. These fusion proteins can 
be coupled to, for example, Talon or Glutathione-Sepharose beads and then probed 
with cell lysates to identify molecules which bind to Dkk. Prior to lysis, the cells may be 
treated with purified Wnt proteins, RNA, or drugs which may modulate Wnt signaling or 
proteins that interact with downstream elements of the Wnt pathway. Lysate proteins 
binding to the fusion proteins can be resolved by SDS-PAGE, isolated and identified by, 
for example protein sequencing or mass spectroscopy, as is known in the art. See, 
e.g., Protein Purification Applications: A Practical Approach (Simon Roe, ed., 2 nd ed. 
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Oxford Univ. Press, 2001) and "Guide to Protein Purification" in Meth. Enzymology vol. 
182 (Academic Press, 1997). 

The activity of Dkk, a Dkk interacting protein, or a complex of Dkk with 
LRP5/LRP6/HBM may be affected by compounds which modulate the interaction 
between Dkk and a Dkk interacting protein (such as those shown in Figure 5) and/or 
Dkk and LRP5/LRP6/HBM. The present invention provides methods and research 
tools for the discovery and characterization of these compounds. The interaction 
between Dkk and a Dkk interacting protein and/or Dkk and LRP5/6/HBM may be 
monitored in vivo and in vitro. Compounds which modulate the stability of a Dkk - 
LRP5/LRP6/HBM complex are potential therapeutic compounds. Example in vitro 
methods include: Binding LRP5/6/HBM, Dkk, or a Dkk interacting protein to a sensor 
chip designed for an instrument such are made by Biacore (Uppsala, Sweden) for the 
performance of an plasmon resonance spectroscopy observation. In this method, the 
chip with one of Dkk, a Dkk interacting protein, or LRP5/6 is first exposed to the other 
under conditions which permit them to form the complex. A test compound is then 
introduced and the output signal of the instrument provides an indication of any effect 
exerted by the test compound. By this method, compounds may be rapidly screened. 
Another, in vitro, method is exemplified by the SAR-by-NMR methods (Shuker ef a/., 
Science. 274:1531-4 (1996)). Briefly, a Dkk-1 binding domain and/or LRP 5 or 6 LBD 
are expressed and purified as 15 N labeled protein by expression in labeled media. The 
labeled protein(s) are allowed to form the complex in solution in an NMR sample tube. 
The heteronuclear correlation spectrum in the presence and absence of a test 
compound provides data at the level of individual residues with regard to interactions 
with the test compound and changes at the protein-protein interface of the complex. 
One of skill in the art knows of many other protocols, e.g. affinity capillary 
electrophoresis (Okun et at. J Biol Chem 276:1057-62 (2001); Vergun and Chu, 
Methods, 19:270-7 (1999)), fluorescence spectroscopy, electron paramagnetic 
resonance, etc. which can monitor the modulation of a complex and/or measure binding 
affinities for complex formation. 
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In vitro protocols for monitoring the modulation of a Dkk/LRP5/LRP6/HBM 
complex include the yeast two hybrid protocol. The yeast two hybrid method may be 
used to monitor the modulation of a complex in vivo by monitoring the expression of 
genes activated by the formation of a complex of fusion proteins of Dkk and LRP ligand 
binding domains. Nucleic acids according to the invention which encode the interacting 
Dkk and LRP LBD domains are incorporated into bait and prey plasmids. The Y2H 
protocol is performed in the presence of one or more test compounds. The modulation 
of the complex is observed by a change in expression of the complex activated gene. It 
will be appreciated by one skilled in the art that test compounds can be added to the 
assay directly or, in the case of proteins, can be coexpressed in the yeast with the bait 
and prey compounds. Similarly, fusion proteins of Dkk and Dkk interacting proteins can 
also be used in a Y2H screen to identify other proteins which modulate the Dkk/Dkk 
interacting protein complex. 

Assay protocols such as these may be used in methods to screen for 
compounds, drugs, treatments which modulate the Dkk/Dkk interacting protein and/or 
Dkk/LRP5/6 complex, whether such modulation occurs by competitive binding, or by 
altering the structure of either LRP 5/6 or Dkk at the binding site, or by stabilizing or 
destablizing the protein-protein interface. It may be anticipated that peptide aptamers 
may competitively bind, although induction of an altered binding site structure by steric 
effects is also possible. 

12.1 Antibodies and Antibody Fragments 

Polyclonal and monoclonal antibodies and fragments of these antibodies which 
bind to Dkk or LRP5/LRP6/HBM can be prepared as would be known in the art. For 
example, suitable host animals can be immunized using appropriate immunization 
protocols and the peptides, polypeptides or proteins of the invention. Peptides for use 
in immunization are typically about 8-40 residues long. If necessary or desired, the 
polypeptide immunogens can be conjugated to suitable carriers. Methods for preparing 
immunogenic conjugates with carriers such as bovine serum albumin (BSA), keyhole 
limpet hemocyanin (KLH), or other carrier proteins are well known in the art (See, 



89 



WO 02/092015 



PCT/US02/15982 



Harlow et a/., 1988). In some circumstances, direct conjugation using, for example, 
carbodiimide reagents, may be effective; in other instances linking reagents such as 
those supplied by Pierce Chemical Co., Rockford, IL, may be desirable to provide 
accessibility to the polypeptide or hapten. The hapten peptides can be extended at 
5 either the amino or carboxy terminus with a cysteine residue or interspersed with 
cysteine residues, for example, to facilitate linking to a carrier. Administration of the 
immunogens is conducted generally by injection over a suitable time period and with 
use of suitable adjuvants, as is generally understood in the art. During the 
immunization schedule, titers of antibodies are taken to determine adequacy of 

1 0 antibody formation. 

Anti-peptide antibodies can be generated using synthetic peptides, for example, 
the peptides derived from the sequence of any Dkk, including Dkk-1 , or 
LRP5/LRP6/HBM. Synthetic peptides can be as small as 2-3 amino acids in length, but 
are preferably at least 3, 5, 10, or 15 or more amino acid residues long. Such peptides 

15 can be determined using programs such as DNAStar. The peptides are coupled to 

■ 

KLH using standard methods and can be immunized into animals such as rabbits. 
Polyclonal anti-Dkk or anti-LRP5/LRP6/HBM peptide antibodies can then be purified, 
for example using Actigel beads containing the covalently bound peptide. 

While the polyclonal antisera produced in this way may be satisfactory for some 

20 applications, for pharmaceutical compositions, use of monoclonal preparations is 

preferred. Immortalized cell lines which secrete the desired monoclonal antibodies may 
be prepared using the standard method of Kohler and Milstein or modifications which 
effect immortalization of lymphocytes or spleen cells, as is generally known (See, e.g., 
Harlow et a/., 1988 and 1998). The immortalized cell lines secreting the desired 

25 antibodies can be screened by immunoassay in which the antigen is the peptide 
hapten, polypeptide or protein. When the appropriate immortalized cell culture 
secreting the desired antibody is identified, the cells can be cultured either in vitro or by 
production in ascites fluid. 

The desired monoclonal antibodies are then recovered from the culture 

30 supernatant or from the ascites supernatant. Fragments of the monoclonal antibodies 
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which contain the immunologically significant portion can be used as agonists or 
antagonists of Dkk activity. Use of immunologically reactive fragments, such as the 
Fab, scFV, Fab\ of F(ab , ) 2 fragments are often preferable, especially in a therapeutic 
context, as these fragments are generally less immunogenic than the whole 
immunoglobulin. 

The antibodies or fragments may also be produced, using current technology, by 
recombinant means. Regions that bind specifically to the desired regions of Dkk or 
LRP5/LRP6/HBM can also be produced in the context of chimeras with multiple species 
origin. Antibody reagents so created are contemplated for use diagnostically or as 
stimulants or inhibitors of Dkk activity. 

In one embodiment, antibodies against Dkk, bind Dkk with high affinity, i.e., 
ranging from 10" 5 to 10~ 9 M. Preferably, the anti-Dkk antibody will comprise a chimeric, 
primate, Primatized®, human or humanized antibody. Also, the invention embraces the 
use of antibody fragments, e.g., Fab's, Fv's, Fab's, F(ab) 2 , and aggregates thereof. 

Another embodiment contemplates chimeric antibodies which recognize Dkk or 
LRP5/LRP6/HBM. A chimeric antibody is intended to refer to an antibody with non- 
human variable regions and human constant regions, most typically rodent variable 
regions and human constant regions. 

A "primatized® antibody" refers to an antibody with primate variable regions, 
e.g., CDR's, and human constant regions. Preferably, such primate variable regions 
are derived from an Old World monkey. 

A "humanized antibody" refers to an antibody with substantially human 
framework and constant regions, and non-human complementarity-determining regions 
(CDRs). "Substantially" refers to the fact that humanized antibodies typically retain at 
least several donor framework residues (i.e., of non-human parent antibody from which 
CDRs are derived). 

Methods for producing chimeric, primate, primatized®, humanized and human 
antibodies are well known in the art. See, e.g., U.S. Patent 5,530,101 , issued to Queen 
et ai; U.S. Patent 5,225,539, issued to Winter et a/.; U.S. Patents 4,816,397 and 
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4,816,567, issued to Boss etai and Cabilly etai respectively, all of which are 
incorporated by reference in their entirety. 

The selection of human constant regions may be significant to the therapeutic 
efficacy of the subject anti-Dkk or LRP5/LRP6/HBM antibody. In a preferred 
embodiment, the subject anti-Dkk or LRP5/LRP6/HBM antibody will comprise human, 
gamma 1, or gamma 3 constant regions and, more preferably, human gamma 1 
constant regions. 

Methods for making human antibodies are also known and include, by way of 
example, production in SCID mice, and in vitro immunization. 

The subject anti-Dkk or LRP5/LRP6/HBM antibodies can be administered by 
various routes of administration, typically parenteral. This is intended to include 
intravenous, intramuscular, subcutaneous, rectal, vaginal, and administration with 
intravenous infusion being preferred. 

The anti-Dkk or LRP5/LRP6/HBM antibody will be formulated for therapeutic 
usage by standard methods, e.g., by addition of pharmaceutical^ acceptable buffers, 
e.g., sterile saline, sterile buffered water, propylene glycol, and combinations thereof. 

Effective dosages will depend on the specific antibody, condition of the patient, 
age, weight, or any other treatments, among other factors. Typically effective dosages 
will range from about 0.001 to about 30 mg/kg body weight, more preferably from about 
0.01 to 25 mg/kg body weight, and most preferably from about 0.1 to about 20 mg/kg 
body weight. 

Such administration may be effected by various protocols, e.g., weekly, bi- 
weekly, or monthly, depending on the dosage administered and patient response. Also, 
it may be desirable to combine such administration with other treatments. 

Antibodies to Dkk-1 interacting proteins, such as those identified in Figure 5, are 
also contemplated according to the present invention, and can be used similarly to the 
Dkk-1 antibodies mentioned in the above methodology. 

The antibodies of the present invention can be utilized in experimental 
screening, as diagnostic reagents, and in therapeutic compositions. 
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12.2 Chemical Libraries 

Agents that are assayed by these methods can be randomly selected or 
rationally selected or designed. As used herein, an agent is said to be randomly 
selected when the agent is chosen randomly without considering the specific 
sequences involved in the association of Dkk-1 alone, Dkk-1 interacting proteins alone, 
or with their associated substrates, binding partners, etc. An example of randomly 
selected agents is the use of a chemical library or a peptide combinatorial library, or a 
growth broth of an organism. 

The agents of the present invention can be, as examples, peptides, small 
molecules, vitamin derivatives, as well as carbohydrates. A skilled artisan can readily 
recognize that there is no limit as to the structural nature of the agents of the present 
invention. 

12.3 Peptide Synthesis 

The peptide agents of the invention can be prepared using standard solid phase 
(or solution phase) peptide synthesis methods, as is known in the art. In addition, the 
DNA encoding these peptides may be synthesized using commercially available 
oligonucleotide synthesis instrumentation and produced recombinantly using standard 
recombinant production systems. The production of polypeptides using solid phase 
peptide synthesis is necessitated if non-nucleic acid-encoded amino acids are to be 
included. 

13. Uses for Agents that Modulate at Least One Activity of Dkk. a Dkk 
Interacting Protein, a Dkk/Pkk Interacting Protein Complex, or a Dkk/LRP5 or 
Dkk/LRP6 Complex 

The proteins and nucleic acids of the invention, such as the proteins or 
polypeptides containing an amino acid sequence of LRP5, Dkk, and Dkk interacting 
proteins are involved in bone mass modulation and lipid modulation of other Wnt 
pathway mediated activity. Agents that modulate (/.e., up and down-regulate) the 
expression of Dkk or Dkk interacting proteins, or agents, such as agonists and 
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antagonists respectively, of at least one activity of Dkk or a Dkk interacting protein may 
be used to modulate biological and pathologic processes associated with the function 
and activity of Dkk or a Dkk interacting protein. 

As used herein, a subject can be preferably any mammal, so long as the 
5 mammal is in need of modulation of a pathological or biological process modulated by a 
protein of the invention. The term "mammal" means an individual belonging to the class 
Mammalia. The invention is particularly useful in the treatment of human subjects. 

As used herein, a biological or pathological process modulated by Dkk or a Dkk 
interacting protein may include binding of Dkk to a Dkk interacting protein, Dkk to 
1 0 LRP5 or LRP6 or release therefrom, inhibiting or activating Dkk or a Dkk interacting 

protein mRNA synthesis or inhibiting Dkk or Dkk interacting protein modulated inhibition 
of LRP5 or LRP6 mediated Wnt signaling. Further bone-related markers may be 
observed such as alkaline phosphatase activity, osteocalcin production, or 
mineralization. 

1 5 Pathological processes refer to a category of biological processes which 

produce a deleterious effect. For example, expression or up-regulation of expression of 
LRP5 or LRP6 and/or Dkk and/or a Dkk interacting protein may be associated with 
certain diseases or pathological conditions. As used herein, an agent is said to 
modulate a pathological process when the agent statistically significantly (p < 0.05) 

20 alters the process from its base level in the subject. For example, the agent may 

reduce the degree or severity of the process mediated by that protein in the subject to 
which the agent was administered. For instance, a disease or pathological condition 
may be prevented, or disease progression modulated by the administration of agents 
which reduce or modulate in some way the expression or at least one activity of a 

25 protein of the invention. 

As LRP5/6 and Dkk are involved both directly and indirectly in bone mass 
modulation, one embodiment of this invention is to use Dkk or Dkk interacting protein 
expression as a method of diagnosing a bone condition or disease. Certain markers 
are associated with specific Wnt signaling conditions (e.g., TCF/LEF activation). 

30 Diagnostic tests for bone conditions may include the steps of testing a sample or an 
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extract thereof for the presence of Dkk or Dkk interacting protein nucleic acids (/.e., 
DNA or RNA), oligomers or fragments thereof or protein products of TCF/LEF regulated 
expression. For example, standard in situ hybridization or other imaging techniques 
can be utilized to observe products of Wnt signaling. 
5 This invention also relates to methods of modulating bone development or bone 

loss conditions. Inhibition of bone loss may be achieved by inhibiting or modulating 
changes in the LRP5/6 mediated Wnt signaling pathway. For example, absence of 
LRP5 activity may be associated with low bone mass. Increased activity LRP5 may be 
associated with high bone mass. Therefore, modulation of LRP5 activity will in turn 

1 0 modulate bone development. Modulation of the Dkk/LRP5/6 or Dkk/Dkk interacting 
protein complex via agonists and antagonists is one embodiment of a method to 
regulate bone development. Such modulation of bone development can result from 
inhibition of the activity of, for example, a Dkk/LRP(5/6) protein complex, a Dkk/Dkk 
interacting protein complex, upregulated transcription of the LRP5 gene or inhibited 

1 5 translation of Dkk or Dkk interacting protein mRNA. 

The agents of the present invention can be provided alone, or in combination 
with other agents that modulate a particular pathological process. As used herein, two 
agents are said to be administered in combination when the two agents are 
administered simultaneously or are administered independently in a fashion such that 

20 the agents will act at the same time. 

The agents of the present invention can be administered via parenteral, 
subcutaneous (sc), intravenous (iv), intramuscular (im), intraperitoneal (ip), transdermal 
or buccal routes. Alternatively, or concurrently, administration may be by the oral route. 
The dosage administered will be dependent upon the age, health, and weight of the 

25 recipient, kind of concurrent treatment, if any, frequency of treatment, and the nature of 
the effect desired. 

The present invention further provides compositions containing one or more 
agents which modulate expression or at least one activity of a protein of the invention. 
While individual needs vary, determination of optimal ranges of effective amounts of 
30 each component is within the skill of the art. Typical dosages of the active agent which 



95 



WO 02/092015 



PCT/US02/15982 



mediate Dkk or Dkk interacting protein activity comprise from about 0.0001 to about 50 
mg/kg body weight. The preferred dosages comprise from about 0.001 to about 50 
mg/kg body weight. The most preferred dosages comprise from about 0.1 to about 1 
mg/kg body weight. In an average human of 70 kg, the range would be from about 7 
pg to about 3.5 g, with a preferred range of about 0.5 mg to about 5 mg. 

In addition to the pharmacologically active agent, the compositions of the 
present invention may contain suitable pharmaceutical^ acceptable carriers comprising 
excipients and auxiliaries which facilitate processing of the active compounds into 
preparations which can be used pharmaceutical^ for delivery to the site of action. 
Suitable formulations for parenteral administration include aqueous solutions of the 
active compounds in water-soluble form, for example, water-soluble salts. In addition, 
suspensions of the active compounds as appropriate oily injection suspensions may be 
administered. Suitable lipophilic solvents or vehicles include fatty oils, for example, 
sesame oil, or synthetic fatty acid esters, (e.g., ethyl oleate or triglycerides). Aqueous 
injection suspensions may contain substances which increase the viscosity of the 
suspension include, for example, sodium carboxymethyl cellulose, sorbitol and/or 
dextran. Optionally, the suspension may also contain stabilizers. Liposomes and other 
non-viral vectors can also be used to encapsulate the agent for delivery into the cell. 

The pharmaceutical formulation for systemic administration according to the 
invention may be formulated for enteral, parenteral, or topical (top) administration. 
Indeed, all three types of formulations may be used simultaneously to achieve systemic 
administration of the active ingredient. 

Suitable formulations for oral administration include hard or soft gelatin capsules, 
pills, tablets, including coated tablets, elixirs, suspensions, syrups or inhalations and 
controlled release forms thereof. 

Potentially, any compound which binds Dkk or a Dkk interacting protein or 
modulates the Dkk/LRP5 or Dkk/LRP6 or Dkk/Dkk interacting protein complex may be 
a therapeutic compound. In one embodiment of the invention, a peptide or nucleic acid 
aptamer according to the invention is used in a therapeutic composition. Such 
compositions may comprise an aptamer, or a LRP5 or LRP6 fragment unmodified or 

4 
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modified. In another embodiment, the therapeutic compound comprises a Dkk-1 
interacting protein, or biologically active fragment thereof. 

Nucleic acid aptamers have been used in compositions for example by chemical 
bonding to a carrier molecule such as polyethylene glycol (PEG) which may facilitate 
uptake or stabilize the aptamer. A di-alkylgylcerol moiety attached to an RNA will 
embed the aptamer in liposomes, thus stabilizing the compound. Incorporating 
chemical substitutions (i.e. changing the 2'OH group of ribose to a 2'NH in RNA confers 
ribonuclease resistance) and capping, etc. can prevent breakdown. Several such 
techniques are discussed for RNA aptamers in Brady and Gold (Rev. MoL Biol. 74:3-13 
(2000)). 

Peptide aptamers may by used in therapeutic applications by the introduction of 
an expression vector directing aptamer expression into the affected tissue such as for 
example by retroviral delivery, by encapsulating the DNA in a delivery complex or 
simple by naked DNA injection. Or, the aptamer itself or a synthetic analog may be 
used directly as a drug. Encapsulation in polymers and lipids may assist in delivery. 
The use of peptide aptamers as therapeutic and diagnostic agents is reviewed by 
Hoppe-Syler and Butz (J. MoL Med. 78:426-430 (2000)). 

In another aspect of the invention. The structure of a constrained peptide 
aptamer of the invention may be determined such as by NMR or X-ray crystallography. 
(Cavanagh et al., Protein NMR Spectroscopy: Principles and Practice , Academic Press, 
1996; Drenth, Principles of Protein X-Rav Crystallography . Springer Verlag, 1999) 
Preferably the structure is determined in complex with the target protein. A small 
molecule analog is then designed according to the positions of functional elements of 
the 3D structure of the aptamer. ( Guidebook on Molecular Modeling in Drug Design . 
Cohen, Ed., Academic Press, 1996; Molecular Modeling and Drug Design (Topics in 
Molecular and Structural Biology) . Vinter and Gardner Eds., CRC Press, 1994) Thus 
the present invention provides a method for the design of effective and specific drugs 
which modulate the activity of Dkk, Dkk interacting proteins, Dkk/Dkk interacting protein 
complex and the Dkk/LRP complex. Small molecule mimetics of the peptide aptamers 
of the present invention are encompassed within the scope of the invention. 
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In practicing the methods of this invention, the compounds of this invention may 
be used alone or in combination, or in combination with other therapeutic or diagnostic 
agents. In certain preferred embodiments, the compounds of this invention may be co- 
administered along with other compounds typically prescribed for these conditions 
according to generally accepted medical practice. For example, the compounds of this 
invention can be administered in combination with other therapeutic agents for the 
treatment of bone loss. Bone loss mediating agents include bone resorption inhibitors 
such as bisphosphonates (e.g., alendronic acid, clodronic acid, etidronic acid, 
pamidronic acid, risedronic acid and tiludronic acid), vitamin D and vitamin D analogs, 
cathepsin K inhibitors, hormonal agents (e.g., calcitonin and estrogen), and selective 
estrogen receptor modulators or SERMs (e.g., raloxifene). And bone forming agents 
such as parathyroid hormone (PTH) and bone morphogenetic proteins (BMP). 

Additionally contemplated are combinations of agents which regulate Dkk-1 and 
agents which regulate lipid levels such as HMG-CoA reductase inhibitors {i.e., statins 
such as Mevacor®, Lipitor® and other inhibitors such as Baycol®, Lescol®, Pravachol® 
and Zocor®), bile acid sequestrants (e.g., Colestid® and Welchol®), fibric acid 
derivatives (Atromid-S®, Lopid®, Tricor®), and nicotinic acid. 
[0001] The compounds of this invention can be utilized in vivo, ordinarily in vertebrates 
and preferably in mammals, such as humans, sheep, horses, cattle, pigs, dogs, cats, 
rats and mice, or in vitro. 

14. Transgenic Animals 

Transgenic animal models can be created which conditionally express Dkk 
and/or LRP5 or LRP6 and/or Dkk interacting proteins, such as those shown in Figure 5. 
These animals can be used as research tools for the study of the physiological effects 
of the Dkk-1/Dkk-1 interacting protein interaction and/or the LRP5 / Dkk interaction. 
Alternatively, transgenic animals can be created which express a transgenic form of 
Dkk alone or in addition to a transgenic form of HBM or express Dkk interacting 
proteins alone or in addition to a transgenic form of Dkk. Transgenic animals 
expressing HBM or LRP5 can be crossed with transgenic animals expressing Dkk or 
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Dkk interacting proteins to obtain heterozygote as well as homozygote animals which 
express both desired genes. 

Animal models may be created to directly modulate the Dkk/Dkk interacting 
protein or Dkk/ LRP5 interaction activity in vivo to serve as a research tool for 
determining the efficacy of candidate compounds which modulate the Dkk/Dkk 
interacting protein or LRP5 / Dkk interaction activity in vitro. Animals, such as 
transgenic mice, can be created using the techniques employed to make transgenic 
mice that express for example, human Dkk or a Dkk interacting protein, or knockouts 
(KO), which may be conditional, of the gene encoding mouse Dkk or Dkk interacting 

■ 

protein. Knock-in animals include animals wherein genes have been introduced and 
animals wherein a gene that was previously knocked-out is reintroduced into the 
animal. Other transgenic animals can be created with inducible forms of Dkk or a Dkk 
interacting protein to study the effects of the gene on bone mass development and loss 
as well as lipid level regulation. These animals can also be used to study long term 
effects of Dkk or Dkk interacting protein modulation. Transgenic animals may be 
created to express peptide aptamers, or produce RNA aptamers. The transgenic 
vectors may direct expression in a tissue specific manner by the use of tissue specific 
promoters. In a preferred embodiment, a peptide aptamer fusion protein is expressed 
using a bone specific promoter. Such systems can provide a tissue specific knock-out 
of Dkk or Dkk interacting protein activity. 

General methods for creating transgenic animals are known in the art, and are 
described in, for example, Strategies in Transgenic Animal Science (Glenn M. 
Monastersky and James M. Robl eds., ASM Press; Washington, DC, 1995); 
Transgenic Animal Technology: A Laboratory Handbook (Carl A. Pinkert ed., Academic 
Press 1994); Transgenic Animals (Louis Marie Houdebine, ed., Harwood Academic 
Press, 1997); Overexpression and Knockout of Cytokines in Transgenic Mice (Chaim 
O. Jacob, ed., Academic Press 1994); Microinjection and Transgenesis: Strategies and 
Protocols (Springer Lab Manual) (Angel Cid-Arregui and Alejandro Garcia-Carranca, 
eds., Springer Verlag 1998); and Manipulating the Mouse Embrvo: A Laboratory 
Manual (Brigid Hogan ef a/., eds., Cold Spring Harbor Laboratory Press 1994). 
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15. Peptide and Nucleotide Aptamers and Peptide Aotamer Mimetics 

Another embodiment contemplates the use of peptide and nucleotide aptamer 
technology to screen for agents which interact with Dkk, which block Dkk from 
interacting with LRP5 or LRP6, or which block any other Dkk ligand interaction, or 
5 which interact with Dkk interacting proteins, such as those shown in Figure 5. Peptide 
aptamers are molecules in which a variable peptide domain is displayed from a scaffold 
protein. Thioredoxin A (trxA) is commonly used for a scaffold. The peptide insert 
destroys the catalytic site of trxA. It is recognized that numerous proteins may also be 
used as scaffolding proteins to constrain and/or present a peptide aptamer. Other 
10 scaffold proteins that could display a constrained peptide aptamer could include 

staphylococcal nuclease, the protease inhibitor eglin C, the Streptomyces tendea alpha- 
amylase inhibitor Tendamistat, Sp1, and green fluorescent protein (GFP) (reviewed in 
Hoppe-Seyler et a/., J. Steroid Biochem MoL Biol. 78:105-1 1 (2001 )), and the S1 
nuclease from Staphylococcus or M13 for phage display. Any molecule to which the 

■ 

15 aptamer could be anchored and presented in its bioactive conformation would be 
suitable. 

Aptamers can then specifically bind to a given target protein in vitro and in vivo 
and have the potential to selectively block the function of their target protein. Peptide 
aptamers are selected from randomized expression libraries on the basis of their in vivo 

20 binding capacity to the desired target protein. Briefly, a target protein (e.g., Dkk, a Dkk 
interacting protein, or LRP5/6) is linked to a heterologous DNA binding domain (BD) 
and expressed as bait in a yeast test strain. Concomitantly, a library coding for different 
peptides (e.g., 16-mers) of randomized sequence inserted in a scaffold protein 
sequence, which are linked to a heterologous transcriptional activation domain (AD) is 

25 expressed as prey. If a peptide binds to a target protein, a functional transcription 
factor is reconstituted, in which the BD and AD are bridged together by interacting 
proteins. This transcription factor is then able to activate the promoter of a marker gene 
which can be monitored by colorimetric enzymatic assays or by growth selection. 
Additional variation, methods of preparing and screening methodologies are described 

30 in, for example, Hoppe-Seyler et a/., J. MoL Med. 78: 426-430 (2000). 
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Nucleotide aptamers are described for example in Brody et ai, Trends Moi 
Biotechnol. 74: 5-13 (2000). Additional methods of making and using nucleotide 
aptamers include SELEX, i.e., Systematic Evolution of Ligands by Exponential 
Enrichment. SELEX is a process of isolating oligonucleotide ligands of a chosen target 
molecule (see Tuerk and Gold, Science 249:505-510 (1990); U.S. Pat. Nos. 5,475,096, 
5,595,877, and 5,660,985). SELEX, as described in Tuerk and Gold, involves admixing 
the target molecule with a pool of oligonucleotides (e.g., RNA) of diverse sequences; 
retaining complexes formed between the target and oligonucleotides; recovering the 
oligonucleotides bound to the target; reverse-transcribing the RNA into DNA; amplifying 
the DNA with polymerase chain reactions (PCR); transcribing the amplified DNA into 
RNA; and repeating the cycle with ever increasing binding stringency. Three enzymatic 
reactions are required for each cycle. It usually takes 1 2-1 5 cycles to isolate aptamers 
of high affinity and specificity to the target. An aptamer is an oligonucleotide that is 
capable of binding to an intended target substance but not other molecules under the 
same conditions. 

In another reference, Bock et a/., Nature 355:564-566 (1990), describe a 
different process from the SELEX method of Tuerk and Gold in that only one enzymatic 
reaction is required for each cycle {i.e., PCR) because the nucleic acid library in Bock's 
method is comprised of DNA instead of RNA. The identification and isolation of 
aptamers of high specificity and affinity with the method of Bock et ai still requires 
repeated cycles in a chromatographic column. 

Other nucleotide aptamer methods include those described by Conrad et ai, 
Meth. EnzymoL 267:336-367 (1996). Conrad et ai describe a variety of methods for 
isolating aptamers, all of which employ repeated cycles to enrich target-bound ligands 
and require a large amount of purified target molecules. More recently described 
methods of making and using nucleotide aptamers include, but are not limited to those 
described in U.S. Patent Nos. 6,180,348; 6,051,388; 5,840,867; 5,780,610, 5,756,291 
and 5,582,981. 

Potentially, any compound which binds Dkk or a Dkk interacting protein or 
modulates the Dkk/Dkk interacting protein or Dkk/LRP5 or Dkk/LRP6 complex may be 
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a therapeutic compound. In one embodiment of the invention, a peptide or nucleic acid 
aptamer according to the invention is used in a therapeutic composition. Such 
compositions may comprise an aptamer, or a LRP5 or LRP6 fragment unmodified or 
modified. 

Nucleic acid aptamers have been used in compositions for example by chemical 
bonding to a carrier molecule such as polyethylene glycol (PEG) which may facilitate 
uptake or stabilize the aptamer. A di-alkylglycerol moiety attached to an RNA will 
embed the aptamer in liposomes, thus stabilizing the compound. Incorporating 
chemical substitutions {i.e., changing the 2-OH group of ribose to a 2-NH in RNA 
confers ribonuclease resistance) and capping, etc. can prevent breakdown. Several 
such techniques are discussed for RNA aptamers in Brody and Gold Rev. Mol. Biol. 
74:3-13(2000). 

Peptide aptamers may by used in therapeutic applications by the introduction of 
an expression vector directing aptamer expression into the affected tissue such as for 
example by retroviral delivery, by encapsulating the DNA in a delivery complex or 
simple by naked DNA injection. Or, the aptamer itself or a synthetic analog may be 
used directly as a drug. Encapsulation in polymers and lipids may assist in delivery. 
The use of peptide aptamers as therapeutic and diagnostic agents is reviewed by 
Hoppe-Syler and Butz J. Mol. Med. 78:426-430 (2000). 

In another aspect of the invention, the structure of a constrained peptide 
aptamer of the invention may be determined such as by NMR or X-ray crystallography. 
(Cavanagh et al., Protein NMR Spectroscopy : Principles and Practice . Academic 
Press, 1996; Drenth, Principles of Protein X-Rav Crystallography . Springer Verlag, 
1999) Preferably the structure is determined in complex with the target protein. A 
small molecule analog is then designed according to the positions of functional 
elements of the 3D structure of the aptamer. (Guidebook on Molecular Modeling in 
Drug Design . Cohen, Ed., Academic Press, 1996; Molecular Modeling and Drug Design 
(Topics in Molecular and Structural BioloavV Vinter and Gardner Eds., CRC Press, 
1994) Thus, a method is provided for the design of effective and specific drugs which 
modulate the activity of Dkk, Dkk interacting proteins, Dkk/Dkk interacting protein 
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complex, and the Dkk/LRP complex. Small molecule mimics of the peptide aptamers of 
the present invention are also encompassed within the scope of the invention. 

16. Alternative Variants of LRP5/LRP6 Having HBM Activity 

5 A structural model of the LRP5/Zmax1 first beta-propeller module was generated 

based on a model prediction in Springer et al. f (1998) J. Molecular Biology, 283:837- 
862. Based on the model, certain amino acid residues were identified as important 
variants of LRP5/HBM/Zmax1 . The following three categories provide examples of 
such variants: 

1 0 The shape of the beta-propeller resembles a disk with inward-sloping sides and 

a hole down the middle. Residue 171 is in a loop on the outer or top surface of the 
domain in blade 4 of propeller module 1 . Thus, variants comprising changed residues 
in structurally equivalent positions in other blades; as well as residues that are slightly 
more interior to the binding pocket, but still accessible to the surface, are important 
1 5 embodiments of the present invention for the study of bone mass modulation by 
LRP5/HBM, for the development of pharmaceuticals and treatments of bone mass 
disorders, and for other objectives of the present invention. The following are examples 
of such variants: 

A214V ( a position equivalent to 171 in blade 5; alanine is not 
20 conserved in other propellers), 

E128V (a position equivalent to 171 in blade 3; glutamate is not 
conserved in other propellers), 

A65V (a position equivalent to 171 in blade 2; alanine is conserved in 

propellers 1-3 but not 4), 
25 G199V (an accessible interior position in blade 5; glycine is 

conserved in propellers 1-3 but not 4), and 

M282V (accessible interior position in blade 1 ; methionine is 

conserved in propellers 1-3 but not 4). 
LRP5/Zmax1 has four beta-propeller structures; the first three beta-propeller 
30 modules conserve a glycine in the position corresponding to residue 171 in human 
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LRP5/Zmax1. Therefore, variants bearing a valine in the equivalent positions in the 
other propellers are important embodiments of the present invention. The following 
variants are useful for the study of bone mass modulation by LRP5/HBM, for the 
development of pharmaceuticals and treatments of bone mass disorders, and for other 
5 objectives of the present invention: G479V, G781 V, and Q1 087V. 

The G171V HBM polymorphism results in "occupied space" of the beta-propeller 
1, with the side-chain from the valine residue sticking out into an open binding pocket 
and potentially altering a ligand/protein interaction. The glycine residue is conserved in 
LRP5/Zmax1 propellers 1 , 2 and 3 but is a glutamine in propeller 4. Therefore, the 
1 0 following variants of LRP5/HBM are important embodiments of the present invention for 
the study of bone mass modulation by LRP5/HBM, for the development of 
pharmaceuticals and treatments of bone mass disorders, and for other objectives of the 
present invention: 

G171K (which introduces a charged side-chain), 
15 G171F (which introduces a ringed side-chain), 

G171I (which introduces a branched side-chain), and 
G171Q (which introduces the propeller 4 residue). 
Furthermore, LRP6 is the closest homolog of LRP5/Zmax1 . LRP6 has a beta- 
propeller structure predicted to be similar, if not identical to Zmaxl . The position 
20 corresponding to glycine 171 in human LRP5/Zmax1 is glycine 158 of human LRP6. 
Thus, corresponding variants of LRP6 are an important embodiment of the present 
invention for the study of the specificity of LRP5/Zmax1 versus its related family 
member, for the development of pharmaceuticals and treatments of bone mass 
disorders, and for other objectives of the present invention. Specifically, for example, a 
25 glycine to valine substitution at the structurally equivalent position, residue 1 58, of 

human LRP6 and similar variants of other species' LRP6 homologs represent important 
research tools. 

Site-directed mutants of LRP5 were generated in the full-length human LRP5 
cDNA using the QuikChange XL-Site T Directed Mutagenesis Kit (catalog #200516, 
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Stratagene, La Jolla, CA) following the manufacturer's protocol. The mutant sequences 
were introduced using complementary synthetic oligonucleotides: 

A65V: TGGTCAGCGGCCTGGAGGATGTGGCCGCAGTGGACTTCC (SEQ ID 

NO:129) and 

GGAAGTCCACTGCGGCCACATCCTCCAGGCCGCTGACCA (SEQ ID 
NO:130) 

E128V: AAGCTGTACTGGACGGACTCAGTGACCAACCGCATCGAGG (SEQ 
IDNO:131)and 

CCTCGATGCGGTTGGTCACTGAGTCCGTCCAGTACAGCTT (SEQ ID 
NO:132) 

G1 71 K: ATGTACTGGACAGACTGGAAGGAGACGCCCCGGATTGAGCG 
(SEQ ID NO: 133) and 

CGCTCAATCCGGGGCGTCTCCTTCCAGTCTGTCCAGTACAT (SEQ ID 
NO: 134) 

♦ 

G171F: ATGTACTGGACAGACTGGTTTGAGACGCCCCGGATTGAGCG (SEQ 
ID NO:135) and 

CGCTCAATCCGGGGCGTCTCAAACCAGTCTGTCCAGTACAT (SEQ ID 
NO:136) 

G171I: ATGTACTGGACAGACTGGATTGAGACGCCCCGGATTGAGCG (SEQ 
ID NO:137) and 

CGCTCAATCCGGGGCGTCTCAATCCAGTCTGTCCAGTACAT (SEQ ID 
NO:138) 

G1 71 Q: ATGTACTGGACAGACTGGCAGGAGACGCCCCGGATTGAGCG 
(SEQIDNO:139)and 

CGCTCAATCCGGGGCGTCTCCTGCCAGTCTGTCCAGTACAT (SEQ ID 
NO:140) 

G1 99V: CGGACATTTACTGGCCCAATGTACTGACCATCGACCTGGAGG 
(SEQ ID NO: 141) and 

CCTCCAGGTCGATGGTCAGTACATTGGGCCAGTAAATGTCCG (SEQ ID 
NO:142) 
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A214V: AGCTCTACTGGGCTGACGTCAAGCTCAGCTTCATCCACCG (SEQ 
ID NO: 143) and 

CGGTGGATGAAGCTGAGCTTGACGTCAGCCCAGTAGAGCT (SEQ ID 
NO:144) 

M282V: GAGTGCCCTCTACTCACCCGTGGACATCCAGGTGCTGAGCC (SEQ 
ID NO:145) and 

GGCTCAGCACCTGGATGTCCACGGGTGAGTAGAGGGCACTC (SEQ ID 
NO: 146) 

G479V: CATGTACTGGACAGACTGGGTAGAGAACCCTAAAATCGAGTGTGC 
(SEQ ID NO:147) and 

GCACACTCGATTTTAGGGTTCTCTACCCAGTCTGTCCAGTACATG (SEQ ID 
NO:148) 

G781V: CATCTACTGG ACCG AGTGGGTCG GCAAG CCGAG G ATCGTGCG 
(SEQ ID NO:149) and 

CGCACGATCCTCGGCTTGCCGACCCACTCGGTCCAGTAGATG (SEQ ID 
NO: 150) 

Q1 087V: GTACTTCACCAACATGGTGGACCGGGCAGCCAAGATCGAACG 
(SEQ ID NO:151)and 

CGTTCGATCTTGGCTGCCCGGTCCACCATGTTGGTGAAGTAC (SEQ ID 

NO: 152) 

LRP6G158V: 

GTACTGGACAGACTGGGTAGAAGTGCCAAAGATAGAACGTGC (SEQ ID 
NO: 153) and 

GCACGTTCTATCTTTGGCACTTCTACCCAGTCTGTCCAGTAC (SEQ ID 
NO:154). 

All constructs were sequence verified to ensure that only the engineered 
modification was present in the gene. Once verified, each variant was functionally 
evaluated in the TCF-luciferase assay in U20S cells (essentially as described in 
Example 7. Other functional evaluations could also be performed, such as the Xenopus 
embryo assay (essentially as described in Example 6), or other assays to evaluate Wnt 
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signaling, Dkk modulation, or anabolic bone effect. Binding of these mutants to Dkk, 
LRP-interacting proteins, Dkk-interacting proteins, or peptide aptamers to any of the 
preceding could also be investigated in a variety of ways such as in a two-hybrid 
system (such as in yeast as described in this application), or other methods. 

Figure 24 shows the effects of the G171 F mutation in propeller 1 of LRP5. This 
mutation is at the same position as HBM's G1 71 V substitution. Expression of G1 71 F 
results in an HBM effect. That is, in the presence of Wnt, G1 71 F is able to activate the 
TCF-luciferase reporter construct. In fact, it may activate the reporter to a greater 
extent than either LRP5 or HBM. Furthermore, in the presence of Dkk1 and Wnt1 , 
G171F is less susceptible than LRP5 to modulation by Dkk. These data exemplify that 
the G1 71 F variant modulates Wnt signaling in a manner similar to HBM. In addition, 
this data confirms that HBM's valine residue at 171 is not the only modification at 171 
that can result in an HBM effect. Together these data support an important role for 
LRP5 propeller 1 in modulating Wnt pathway activity; in responding to Dkk modulation; 
and, in the ability to generate an HBM effect. 

Figure 25 shows the effects of the M282V mutation in propeller 1 of LRP5. 
M282 expression results in an HBM-effect. That is, in the presence of Wnt, M282 is 
able to activate the TCF-luciferase reporter construct. Furthermore, in the presence of 
Dkk1 and Wnt1, M282V is less susceptible than LRP5 to modulation by Dkk. These 
data show that the M282V variant modulates Wnt signaling in a manner similar to HBM. 
In addition, this data confirms that modifications of other residues in propeller 1 of LRP5 
can result in an HBM effect. 

These data support an "occupied space" model of the HBM mutation in propeller 
1 and show that multiple mutations of propeller 1 are capable of generating an HBM 
effect; the original G171V HBM mutation is not unique in this ability. Moreover, various 
perturbations in propeller 1 can modulate Dkk activity. 

These data illustrate the molecular mechanism of Dkk modulation of LRP 
signaling. Using the methods disclosed herein and in U.S. Application 60/290,071, 
generation of a comprehensive mutant panel will reveal residues in LRP that function in 
Dkk modulation of Wnt signaling. Such variants of LRP5 and LRP6 that modulate Dkk 
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activity and the residues which distinguish them from LRP5 and LRP6 are points for 
therapeutic intervention by small molecule compound, antibody, peptide aptamer, or 
other agents. Furthermore, models of each HBM-effect mutation/polymorphism may be 
used in rational drug design of an HBM mimetic agent. 
5 These are only a few illustrative examples presented to better describe the 

present invention. Variants of LRP5 which have demonstrated HBM activity in assays 
include G171F, M282V, G171K, G171Q and A214V. Clearly, other variants may be 
contemplated within the scope of the present invention. Furthermore, wherever HBM is 
recited in the methods of the invention, it should be understood that any such 
1 0 alternative variant of LRP which demonstrates HBM biological activity is also 
encompassed by those claims. 

17. Screening Assays 

The two-hybrid system is extremely useful for studying proteimprotein 
15 interactions. See, e.g., Chien et a/., Proc. Natl Acad. ScL USA 88:9578-82 (1991); 
Fields et a/., Trends Genetics 10:286-92 (1994); Harper etal., Cell 75:805-1 6 (1993); 
Vojtek ef a/., Ce//74:205-14 (1993); Luban et a/., Cell 73:1 067-78 (1993); Li etal, 
FASEB J. 7:957-63 (1 993); Zang et a/., Nature 364:308-1 3 (1 993); Golemis et a/., Mol. 
Cell. Biol. 12:3006-14 (1992); Sato etal, Proc. Natl Acad. ScL USA 91:9238-42 (1994); 
20 Coghlan et a/., Science 267:108-111 (1995); Kalpana et a/., Science 266:2002-6 

(1994); Helps et al., FEBS Lett. 340:93-8 (1994); Yeung et a/., Genes & Devel. 8:2087- 
9 (1994); Durfee et a/., Genes & Devel. 7:555-569 (1993); Paetkau et a/., Genes & 
DeveL 8:2035-45; Spaargaren ef a/., 1994 Proc. Natl. Acad. ScL USA 91:12609-13 
(1994); Ye et al., Proc. Natl Acad. ScL USA 91:12629-33 (1994); and U.S. Patent Nos. 
25 5,989,808; 6,251 ,602; and 6,284,519. 

Variations of the system are available for screening yeast phagemid (see, e.g., 
Harper, Cellular Interactions and Development: A Practical Approach . 153-179 (1993); 
Elledge et at., Proc. Natl Acad. ScL USA 88:1731-5 (1991)) or plasmid (Bartel, 1993 
and Bartel, Cell 14:920-4 (1993)); Finley etal., Proc. Natl Acad. ScL USA 91:12980-4 
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(1994)) cDNA libraries to clone interacting proteins, as well as for studying known 
protein pairs. 

The success of the two-hybrid system relies upon the fact that the DNA binding 
and polymerase activation domains of many transcription factors, such as GAL4, can 
5 be separated and then rejoined to restore functionality (Morin et a/., Nuc. Acids Res. 
21 :21 57-63 (1993)). While these examples describe two-hybrid screens in the yeast 
system, it is understood that a two-hybrid screen may be conducted in other systems 
such as mammalian cell lines. The invention is therefore not limited to the use of a 
yeast two-hybrid system, but encompasses such alternative systems. 

1 0 Yeast strains with integrated copies of various reporter gene cassettes, such as 

for example GAL.fwdarw.LacZ, GAL.fwdarw.HIS3 or GAL.fwdarw.URA3 (Bartel, in 
Cellular Interactions and Development: A Practical Approach . 153-179 (1993); Harper 
et a/., Ce//75:805-16 (1993); Fields et aL, Trends Genetics 10:286-92 (1994)) are co- 
transformed with two plasmids, each expressing a different fusion protein. One plasmid 

1 5 encodes a fusion between protein "X" and the DNA binding domain of, for example, the 
GAL4 yeast transcription activator (Brent et a/., Cell 43:729-36 (1985); Ma et a/., Cell 
48:847-53 (1987); Keegan et a/., Science 231:699-704 (1986)), while the other plasmid 
encodes a fusion between protein "Y" and the RNA polymerase activation domain of 
GAL4 (Keegan et a/., 1986). The plasmids are transformed into a strain of the yeast 

20 that contains a reporter gene, such as lacZ, whose regulatory region contains GAL4 
binding sites. If proteins X and Y interact, they reconstitute a functional GAL4 
transcription activator protein by bringing the two GAL4 components into sufficient 
proximity to activate transcription. It is well understood that the role of bait and prey 
proteins may be alternatively switched and thus the embodiments of this invention 

25 contemplate and encompass both alternative arrangements. 

Either hybrid protein alone must be unable to activate transcription of the 
reporter gene, the DNA-binding domain hybrid, because it does not provide an 
activation function, and the activation domain hybrid, because it cannot localize to the 
GAL4 binding sites. Interaction of the two test proteins reconstitutes the function of 

30 GAL4 and results in expression of the reporter gene. The reporter gene cassettes 
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consist of minimal promoters that contain the GAL4 DNA recognition site (Johnson et 
a/., Mol. Cell. Biol. 4:1440-8 (1984); Lorch etaL, J. Mol. Biol. 186:821-824 (1984)) 
cloned 5' to their TATA box. Transcription activation is scored by measuring either the 
expression of (3-galactosidase or the growth of the transformants on minimal medium 
lacking the specific nutrient that permits auxotrophic selection for the transcription 
product, e.g., URA3 (uracil selection) or HIS3 (histidine selection). See, e.g., Bartel, 
1993; Durfee et a/., Genes & Devel. 7:555-569 (1993); Fields et a/., Trends Genet 
10:286-292 (1994); and U.S. Pat. No. 5,283,173. 

Generally, these methods include two proteins to be tested for interaction which 
are expressed as hybrids in the nucleus of a yeast cell. One of the proteins is fused to 
the DNA-binding domain (DBD) of a transcription factor and the other is fused to a 
transcription activation domain (AD). If the proteins interact, they reconstitute a 
functional transcription factor that activates one or more reporter genes that contain 
binding sites for the DBD. Exemplary two-hybrid assays which have been used for 
Dkk-1 or Dkk-1/LRP5 are presented in the Examples below. 

Additional methods of preparing two hybrid assay systems for Dkk-1 interactors 
would be evident to one of ordinary skill in the art. See for example, Finley et a/., "Two- 
Hybrid Analysis of Genetic Regulatory Networks," in The Yeast Two-Hvbrid System 
(Paul L. Bartel et al., eds., Oxford, 1997); Meijia Yang, "Use of a Combinatorial Peptide 
Library in the Two-Hybrid Assay," in The Yeast Two-Hybrid System (Paul L. Bartel et 
al., eds., Oxford, 1997); Gietz et a/., "Identification of proteins that interact with a protein 
of interest: Applications of the yeast two-hybrid system," Mol. & Cell. Biochem. 172:67-9 
(1997); K. H. Young, "Yeast Two-Hybrid: So Many lnteractions,(in) so Little Time," Biol. 
Reprod. 58:302-311 (1998); R. Brent et a/., "Understanding Gene and Allele Function 
with Two-Hybrid Methods," Annu. Rev. Genet. 31 :663-704 (1 997). It will be 
appreciated that protein networks can be elucidated by performing sequential screens 
of activation domain-fusion libraries. 

Without further description, it is believed that one of ordinary skill in the art can, 
using the preceding description and the following illustrative examples, make and utilize 
the compounds of the present invention and practice the claimed methods. The 
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following working examples therefore, specifically point out preferred embodiments of 
the present invention, and are not to be construed as limiting in any way the remainder 
of the disclosure. 

EXAMPLES 

The present invention is described by reference to the following Examples, 
which are offered by way of illustration and are not intended to limit the invention in any 
manner. Standard techniques well-known in the art or the techniques specifically 
described below were utilized. 

For routine practice of the protocols referenced below, one of skill in the art is 
directed to the references cited in this application as well as the several Current 
Protocol guides, which are continuously updated, widely available and published by 
John Wiley and Sons, (New York). In the life sciences, Current Protocols publishes 
comprehensive manuals in Molecular Biology, Immunology, Human Genetics, Protein 
Science, Cytometry, Neuroscience, Pharmacology, Cell Biology, Toxicology, and 
Nucleic Acid Chemistry. Additional sources are known to one of skill in the art. 

Example 1 

Yeast Two Hybrid Screen Using LRP5 Liaand Binding Domain (LBD) Bait Sequences 

In a screen against human osteoblast library (/.e M HOB03C5, a custom Gibco 
generated Y2H compatible cDNA library from a human osteoblast cell line as described 
by Bodine and Komm, Bone 25:535-43 (1999)), an interaction with Dkk-1 was 
identified. The LRP5 ligand binding domain (LBD) baits used for this screen are 
depicted in Figures 2B and C. The basic protocol is as follows: 

An overnight culture of the yeast strain containing the bait of interest is grown in 
20 ml of appropriate selective medium containing 2% glucose at 30°C. The overnight 
culture is diluted by a 10 fold factor into YPDmedia supplemented with 40 mg/l of 
adenine, and grown for 4 hours at 30°C. 

For each mating event, an aliquot of the frozen prey library is grown in 150 ml 
YAPD medium for 5 hours at 30°C. 
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Appropriate volumes calculated by measuring the OD600 of each culture are 
combined into a tube. The number of diploids to be screened is typically ten times the 
number of clones originally present in the prey library of interest. Assuming a mating 
efficiency of 20% minimum, fifty times (i.e., ten times coverage multiplied by 20% 
5 mating efficiency) as many haploid cells containing the bait and as many cells 

containing the prey are used in any given mating event. The mixture is filtered over a 47 
mm, 0.45 mm sterile Metricel filter membrane (Gelman). 

Using sterile forceps, the filter is transferred onto a 100 mm 2 YAPD agar plate 
with the cell side up, removing all air bubbles underneath the filter. The plate is 
1 0 incubated overnight at room temperature. 

The filter is transferred into a 50 ml Falcon tube using sterile forceps and 10 ml 
SD medium containing 2% glucose are added to resuspend the cells. The filter, once 
free of cells, is removed and the cell suspension is spun for 5 min. at 2,000 xg. 

The cells are resuspended in 10 ml SD medium containing 2% glucose. An 
1 5 aliquot of 1 00 /A is set aside for titration. 

The cells are plated onto large square plates containing appropriate selective 
media and incubated at 30 °C for three to five days. 

To calculate the mating efficiency and to determine the total number of diploid 
cells screened, the 100 /A aliquot set aside for titration is diluted and plated onto 
20 different selective media. The mating efficiency is calculated by dividing the number of 
diploids/ml by the lowest number of haploids/ml, either bait or prey, and multiplied by 
100. For example, if 2 million diploids were obtained by mating 10 million of haploids 
containing a bait and 12 million of haploids containing a prey, then the mating efficiency 
is calculated by dividing 2 million by 10 million, which equals 0.2 and multiplied by 100 
25 which equals 20%. Typical mating efficiencies under the above conditions are within 
about 20 to about 40%. The total number of diploids screened in a mating event is 
obtained by multiplying the number of diploids/ml by the total number of ml plated, 
typically about 10. 

30 
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Isolation of colonies containing pairs of interacting proteins. 

Yeast colonies from the interaction selection (large square) plates are picked 
with a sterile toothpick and patched onto plates containing the appropriate selective 
media and incubated at 30°C for two days. 

To further ensure purity of the yeast, the plates are replicated onto another plate 
containing the same media and incubated at 30 °C for another two days. 

Yeast patches are scraped using a sterile toothpick and placed into a 96-well 
format plate containing 100 /x\ SD -L -W -H with 2% glucose liquid medium. 

Half the volume of the plate is transferred to a 96-well plate containing 50 {A of 
40% glycerol for storage. The other half is set aside for replication and galactosidase- 
activity assay (see below). 

Cells are replicated onto a SD -L -W -H plate with 2% glucose plate to create a 
master plate, and incubated two days at 30°C. The master plate is replicated onto 
different selective media to score the strength of each interaction. 

Cells are also replicated onto media selecting for the prey vector only for colony 
PGR and incubated two days at 30°C. 

Galactosidase activity assay 

Ten microliters from the 96-well plate (set aside from above) are transferred into 
another 96-well plate containing 100 SD and 2% glucose media. The cell density is 
measured at OD^ using a spectrophotometer, the OD 600 is usually between 0.03 and 
0.1 . Fifty microliters of Galactosidase reaction mixture (Tropix) are added to 
microplates (Marsh) specifically designed for the luminometer (Hewlett Packard 
Lumicount). Fifty microliters of the diluted cells are then added and mixed by pipetting. 
The reaction is incubated sixty to one hundred twenty minutes at room temperature. 
Relative Light Units (RLUs) are read by the luminometer. Each plate contains a 
negative control, constituted by diploid yeast containing the bait of interest and an 
empty prey vector. To be scored as positive, the diploids tested have to have an RLU 
number at least twice as high as the negative control. 
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Example 2 

Minimum interaction domain mapping 
Further analysis of yeast two hybrid (Y2H) interacting proteins includes the 
dissection of protein motifs responsible for the interaction. Sequence alignment of 
5 multiple clones identified in the Y2H screens can help identify the smallest common 
region responsible for the interaction. In the absence of appropriate clones, deletion 
mapping of interacting domains is necessary. 

PCR primers containing restriction sites suitable for cloning are designed to 
cover multiple sub-domains of the protein of interest (bait or prey). The methods 
10 involved in cloning, sequencing, yeast transformation, mating, and scoring of 

interactions are readily performed by one of ordinary skill in the art of molecular biology 
and genetic engineering. 

Materials and Methods 

1 5 Minimum interaction domain: primers were designed for PCR of the Dkk-1 clone 

isolated by screening a primary osteoblast cell strain (HOB03C5) library with pooled 
Zmax1/LRP5 ligand binding domain (LBD) baits: LBD1 (Leu969-Pro1376) and LBD4 
(Arg1070-Pro1376). The primers, which are presented in 5' to 3* orientation, were as 
follows: 

20 



SEQ ID NO 


Primer 


Sequence 


155 


Forward 1 


I I I I I I GTCGACCAATTCCAACGCTATCAAG 


156 


Forward 2 


I I I I I I GTCGACCTGCGCTAGTCCCACCCGC 


157 


Forward 3 


I I I I I I GTCGACCGTGTCTTCTGATCAAAATC 

i 


158 


Forward 4 


* 

1 1 1 1 1 1 G 1 CGACCGGACAAGAAGG 1 ICIGI 1 IG 


159 


Reverse 1 


rTTTTTGCGGCCGC 1 IAI 1 IGGTGIGAIACAI HUG 


160 


Reverse 2 


TTTTTTGCGGCCGCTTAGCAAGACAGACCTTCTCC 


161 


Reverse 3 


TTTTTTGCGGCCGCTTAGTGTCTCTGACAAGTGTG 
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PCR was performed using PfuTurbo® polymerase (Stratagene). The PCR 
products were gel purified, digested with Sa/1/ Nott and ligated to pPC86 (Gibco/BRL) 
which had been linearized with Sal\/Not\. Clones were recovered and sequenced to 
ascertain that the structure was as expected and that the Gal4 activation domain and 
Dkk-1 were in-frame. The ORF of Dkk-1 was Met1-His266, as in human Dkk-1 
(GenBank Accession No. XM_005730). 

The clones used were as follows: D5 (F1/R3: Asn34-His266), D4 (F1/R2: 
Asn34-Cys245), D3 (F1/R1: Asn34-Lys1 82), D9 (F2/R3: Cys97-His266), D12 (F3/R3, 
val139-His266), D14 (F4/R3: Gly183-His266), D8 (F2/R2: Cys97-Cys245), and D11 
(F3/R2: Val1 39-Cys245). F1 , F2, F3 and F4 refer respectively to Forward primers 1 , 2, 
3 and 4. R1 , R2 and R3 refer respectively to reverse primers 1 , 2 and 3. 

These clones were transformed into yeast and mated with each of three yeast 
strains containing pDBIeu (Gibco/BRL), pDBIeuLBDI , and pDBIeuLBD4. Positive 
interactions were detected by growth of the hybrids on appropriate selective media. 

Results 

Minimum interaction domain: Figure 6 shows that while growth was observed in 
diploids of D4, D5, D8, D9, and D12, no growth was observed in hybrids of D3, D1 1 , 
and D12. Carboxy terminal (C-terminal) deletions indicated that while the C-terminal 
amino acids of Dkk-1 containing the potential N-glycosylation site (Arg246-His266) are 
not required for interaction with Zmax1/LRP5 LBD baits, the Cys2 domain, Gly183- 
Cys245, is required. N-terminal deletions also demonstrated that the region between 
the two cysteine domains, i.e. Val139 to Lys182, is also required. Two minimum 
interaction domain constructs were isolated: D1 2 (Val1 39-His266) and D8 (Cys97- 
Cys245). Similar constructs could be prepared for Dkk-1 interactors. 

Example 3 

Yeast-2 Hybrid screen for peptide aptamer sequences to Dkk-1 
Peptide aptamer library construction 
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A peptide aptamer library, Tpep, was constructed, which provides a means to 
identify chimeric proteins that bind to a protein target (or bait) of interest using classic 
yeast two hybrid (Y2H) assays. The Tpep library is a combinatorial aptamer library 
composed of constrained random peptides, expressed within the context of the disulfide 
loop of E. coli thioredoxin (trxA), and as C-termini fusion to the S. cerevisiae Gal4 
activation domain. The Tpep library was generated using a restriction enzyme modified 
recombinant Y2H prey vector, pPC86 (Gibco), which contains the trxA scaffold protein. 

Generation of aptamer-encoding sequences 

Aptamer-encoding sequences were produced as follows. DNA encoding 
random stretches of approximately sixteen amino acids surrounded by appropriate 
restriction sites were generated by semi-random oligonucleotide synthesis. The 
synthetic oligonucleotides were PCR-amplified, restriction digested, and cloned into the 
permissive sites within the trxA scaffold protein. The cloning strategy was to insert the 
random oligonucleotide sequence is in-frame with the scaffold protein coding sequence, 
resulting in expression of a scaffold protein-aptamer chimera. The scaffold protein is 
itself in-frame with the activation domain of Gal4, within the pPC86 vector that is 
appropriate for the aptamer to be expressed and functional in a regular Y2H assay. 
Additional methods of preparing aptamers would be apparent to the skilled artisan. 

Generation of a permissive recombinant pPC86 vector containing the trxA coding 
sequence 

First the Rsri\ restriction site located within the Gal4 activation domain of pPC86 
(Gibco) was eliminated by site-directed mutagenesis (Quickchange™ kit, Stratagene). 
The amino acid sequence of the Gal4 activation domain was unchanged by this 
modification. The strength of different control interactions was verified to be 
unchanged by the modification. 

Second, the E. coli trxA coding sequence was cloned into the Sail and A/of I sites 
of the Rs/11-modified pPC86. EcoRI and Spel sites were then introduced within the trxA 
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Rsr\\ site. The oligonucleotides encoding the peptide aptamers were cloned into the 
EcoRI and Spel sites of the resulting vector. 

Example 4 

5 Yeast-2 Hybrid screen for Dkk-1 interacting proteins 

A Dkk-1 bait sequence was utilized in a yeast two hybrid screen to identify Dkk-1 
interacting proteins. The procedure for the Y2H was carried out similarly to that 
employed in Example 1 , except that the Dkk-1 bait from Figure 2C was used instead of 
10 LRP baits. The screen was performed using Hela and fetal brain libraries (Invitrogen 
Corporation, Carlsbad, CA). Multiple libraries were used to identify additional Dkk-1 
interacting proteins and to confirm interactions found in other libraries. 

The list of Dkk-1 interacting proteins uncovered in these Y2H screens are listed 
in Figure 5. 

15 The interacting proteins identified in the Dkk-1 bait screen can be used in other 

Y2H screens with LRP baits and other Dkk-1 interacting proteins to determine more 
complex interactions which may modulate Dkk-1/LRP interactions and/or Wnt signaling. 

Example 5 

20 Generation of antibodies 

In each of the following antibody-generating examples, the synthesis of these 
linear peptides is followed by injection into two New Zealand Rabbits. Subsequent 
boosts and bleeds are taken according to a standard ten-week protocol. The end-user 
25 receives back 5 mgs of peptide, aliquots of pre-bleeds, roughly 80 ml of crude sera 
from each of the two rabbits and, and ELISA titration data is obtained. 

Generation ofLRP5 Polymorphism-specific antibodies 

Antibodies were generated to the following peptides to obtain antibodies which 
30 distinguish the HBM polymorphism versus wild-type LRP5/Zmax: MYWTDWVETPRIE 



117 



WO 02/092015 



PCTAJS02/15982 



(SEQ ID NO:123) (mutant peptide) and MYWTDWGETPRIE (SEQ ID NO:124) (wild- 
type peptide for negative selection). Immunofluorescence data confirmed that the 
antibody, after affinity purification, is specific for HBM and does not recognize LRP5 
(Figure 1 7). 

Generation ofLRP5 Monospecific antibodies 

LRP5 monospecific polyclonal antibodies were generated to the following amino 
acid sequences of LRP5: Peptide 1 (a.a. 265-277) - KRTGGKRKEILSA (SEQ ID 
NO:125), Peptide 2 (a.a. 1178-1194) - ERVEKTTGDKRTRIQGR (SEQ ID NO:126), 
and Peptide 3 (a.a. 1352-1375) - KQQCDSFPDCIDGSDE (SEQ ID NO:127). 
Immunofluorescence confirmed that the antibody generated detects LRP5. 

Generation ofDkk-1 monospecific polyclonal antibodies 

Dkk-1 monospecific polyclonal antibodies were generated to the following amino 
acid sequences of Dkk-1: Peptide 1 (a.a. 71-85) - GNKYQTIDNYQPYPC (SEQ ID 
NO:118), Peptide 2 (a.a. 165-186) - LDGYSRRTTLSSKMYHTKGQEG (SEQ ID 
NO:119 ),. Peptide 3 (a.a. 246-266) - RIQKDHHQASNSSRLHTCQRH (SEQ ID 
NO: 120), Peptide 4 (a.a. 147-161) - RGEIEETITESFGND (SEQ ID NO: 121), and 
Peptide 5 (232-250) - E I FQ RCYCG EG LSCRIQ KD (SEQ ID NO:122) of human Dkk-1 . 
Figure 26 shows the location of the various peptides selected, their relationship to the 
Dkk-1 amino acid sequence and polyclonal antibodies generated. 

Western blots demonstrated that the antibodies generated against peptides 2 
(Antibody #5521) (Figure 27) and 4 (Antibody #74397) (Figure 28) are specific toward 
Dkk-1 . Figure 27 shows Western blots using 500 fA of conditioned medium (CM) from " 
non-transfected 293 cells or from 293 cells transfected with Dkk1 -V5 that were 
immunoprecipitated by anti-V5 antibody. Bead elutes were separated by non-reducing 
SDS-PAGE (lanes #4, 5 of Figure 27). 20 (A of conditioned medium from both samples 
(lanes #2, 3 of Figure 27) and from Dkk1-AP transfected 293 cells (lane #6 of Figure 
27) were additionally separated on the gel. The Western was performed using 
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antibodies Anti-V5/AP (1:10,000) and Ab#5521 (10^g/ml). Ab#5521 detected Dkk1-V5 
and Dkk1-AP from conditioned medium. 

Figure 28 shows Western blot results using Ab#74397. Anti-V5/AP was tested 
at a 1 :4000 dilution and Ab#74397 was tested at a 1:500 dilution. Ab#74397 was able 
to detect Dkk1-V5 in both conditioned medium and immunoprecipitated conditioned 
medium. 



The results obtained with antibodies #5521 and #74397 are summarized in the 
following table: 



Rabbit 
No. 


Peptide 
Position 


Peptide 
Sequence 


Purified 
(Y/N) 


Western 


Immuno- 
precipitation 


Location 


5521 


165-186 


LDGYSR 
RTTLSSK 
MYHTKG 
QEG 


Y (Protein 
G 

purified) 


Y 


N/A 


Between 
Cy1 and 
Cys2 
domain 


74397 


147-161 


RGEIEETI 

TESFGN 

D 


N 


Y 


N/A 


Between 
Cy1 and 
Cys2 
domain 



Example 6 

Effects of exogenous Dkk-1 on Wnt-mediated signaling in the Xenopus embryo assay 

Xenopus embryos are an informative and well-established in vivo assay system 
to evaluate the modulation of Wnt signaling (McMahon et a/., Cell 58: 1075-84 (1989); 
Smith and Harland, 1991; reviewed in Wodarz and Nusse 1998) . 

Modification of the Wnt signaling pathway can be visualized by examining the 
embryos for a dorsalization phenotype (duplicated body axis) after RNA injection into 
the ventral blastomere at the 4- or 8-cell stage. On the molecular level, phenotypes can 
be analyzed by looking for expression of various marker genes in stage 10.5 embryos. 
Such markers would include general endoderm, mesoderm, and ectoderm markers as 
well as a variety of tissue-specific transcripts. 
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Analysis can be done by RT-PCR/TaqMan® and can be done on whole embryo 
tissue or in a more restricted fashion (microdissection). Because this system is very 
flexible and rapid, by injecting combinations of transcripts, such as HBM and different 
Wnts or Wnt antagonists, the mechanism of HBM in the Wnt pathwaycan thereby be 
dissected. Furthermore, investigations are conducted to determine whether 
Zmax/LRP5 and HBM differentially modulate Wnt signaling either alone, or in 
combination with other components. Previous studies have demonstrated that LRP6 
alone or LRP5 + Wnt5a were able to induce axis duplication (dorsalization) in this 
system (Tamai et a/., Nature 407: 530-35 (2000)). 

Constructs forXenopus Expression (Vector pCS2*) 

Constructs were prepared using the vector pCS2*. DNA inserts were subcloned 
in the sense orientation with respect to the vector SP6 promoter. The pCS2* vector 
contains an SV40 virus polyadenylation signal and T3 promoter sequence (for 
generation of antisense mRNA) downstream of the insert. 

Full length Zmax/LRP5 and HBM ORF cDNA : Insert cDNA was isolated from 
the full length cDNA retrovirus constructs (with optimized Kozak sequences) by Bg/ll- 
EcoRI digestion and subcloned into the BamHI-EcoRI sites of the pCS2 + vector. 

Full length XWnt8: This cDNA was PCR amplified from a Xenopus embryo 
cDNA library using oligos 1 14484 (SEQ ID NO: 162) (5'- 

CAGTGAATTCACCATGCAAAACACCACTTTGTTC-3 , ) and 114487 (SEQ ID NO: 163) 
(5 , -CAGTTGCGGCCGCTCATCTCCGGTGGCCTCTG-3 , ). The oligos were designed 
to amplify the ORF with a consensus Kozak sequence at the 5' end as determined from 
GenBank #X57234. PCR was carried out using the following conditions: 96°C, 45 sec; 
63°C, 45 sec; 72°C, 2 min. for 30 cycles. The resulting PCR product was purified, 
subcloned into pCRII-TOPO (Invitrogen Corp.), sequence verified, and digested with 
BamH\/Xho\. This insert was subcloned into the vector at the BamH\-Xho\ sites. 

Full length Wnt5a: A murine Wnt5a cDNA clone was purchased from Upstate 
Biotechnology (Lake Placid, NY) and subcloned into the EcoR\ site of the vector. 
Sequencing confirmed insert orientation. 



120 



WO 02/092015 



PCT/US02/15982 



Full length human Dkk-1 : A human cDNA with GenBank accession number 
AF1 27563 was available in the public database. Using this sequence, PCR primers 
were designed to amplify the open reading frame with a consensus Kozak sequence 
immediately upstream of the initiating ATG. Oligos 1 17162 (SEQ ID NO:164) (5'- 
5 CAATAGTCGACGAATTCACCATGGCTCTGGGCGCAGCGG-3 , ) and 1 1 71 63 (SEQ 
ID NO:165) (5 l -GTATTGCGGCCGCTCTAGATTAGTGTCTCTGACAAGTGTGAA-3 , ) 
were used to screen a human uterus cDNA library by PCR. The resulting PCR product 
was purified, subcloned into pCRII-TOPO (Invitrogen Corp.), sequence verified, and 
digested with EcoRl/Xhol This insert was subcloned into the pCS2 + vector at the 

10 EcoR\-Xho\ sites. 

Full length human Dkk-2: A full length cDNA encoding human Dkk-2 was 
isolated to investigate the specificity of the Zmax/LRP5/HBM interaction with the Dkk 
family of molecules. Dkk-1 was identified in yeast as a potential binding partner of 
Zmax/LRP5/HBM. Dkk-1 has also been shown in the literature to be an antagonist of 

15 the Wnt signaling pathway, while Dkk-2 is not (Krupnik et a/., 1999). The Dkk-2 full 
length cDNA serves as a tool to discriminate the specificity and biological significance 
of Zmax/LRP5/HBM interactions with the Dkk family (e.g., Dkk-1, Dkk-2, Dkk-3, Dkk-4, 
Soggy, their homologs and variant, etc.). A human cDNA sequence for Dkk-2 
(GenBank Accession No. NMJ31 4421) was available in the public database. Using this 

20 sequence, PCR primers were designed to amplify the open reading frame with a 

consensus Kozak sequence immediately upstream of the initiating ATG. Oligos 51409 
(SEQ ID NO:166) (5'- CTAACGGATCCACCATGGCCGCGTTGATGCGG-3 , ) and 
51411 (SEQ ID NO: 167) (5-G ATTCG AATTCTC AAATTTTCTG AC AC AC ATG G-3' ) 
were used to screen human embryo and brain cDNA libraries by PCR. The resulting 

25 PCR product was purified, subcloned into pCRII-TOPO, sequence verified, and 

digested with SamHI/EcoRI. This insert was subcloned into the pCS2 + vector at the 
BamH\-EcoR\ sites. 

Full length LRP6 was isolated from the pED6dpc4 vector by Xho\-Xba\ digestion. 
The full length cDNA was reassembled into the Xho\~Xba\ sites of pCS2 + . Insert 
30 orientation was confirmed by DNA sequencing. 
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/ 



mRNA Synthesis and Microinjection Protocol 

mRNA for microinjection into Xenopus embryos is generated by in vitro 
transcription using the cDNA constructs in the pCS2* vector described above as 
template. RNA is synthesized using the Ambion mMessage mMachine high yield 
5 capped RNA transcription kit (Cat. #1340) following the manufacturer's specifications 
for the Sp6 polymerase reactions. RNA products were brought up to a final volume of 
50 pi in sterile, glass-distilled water and purified over Quick Spin Columns for 
Radiolabeled RNA Purification G50-Sephadex (Roche, Cat. #127401 5) following the 
manufacturer's specifications. The resulting eluate was finally extracted with 

10 phenol:chloroform:isoamyl alcohol and isopropanol precipitated using standard 

protocols (Sambrook et aA, 1989). Final RNA volumes were approximately 50 pi. RNA 
concentration was determined by absorbance values at 260 nm and 280 nm. RNA 
integrity was visualized by ethidium bromide staining of denaturing (formaldehyde) 
agarose gel electrophoresis (Sambrook et al., 1989). Various amounts of RNA (2 pg to 

15 1 ng) are injected into the ventral blastomere of the 4- or 8-cell Xenopus embryo. 

These protocols are described in Moon etal., Technique-J. of Methods in Cell and Mol. 
Biol. 1: 76-89 (1989), and Peng, Meth. Cell. Biol. 36: 657-62 (1991). 

Screening for Duplicated Body Axis 

20 In vitro transcribed RNA is purified and injected into a ventral blasomere of the 4- 

or 8-cell Xenopus embryo (approx. 2 hours post-fertilization). At stage 10.5 (approx. 1 1 
hours post-fertilization), the injected embryos are cultured for a total of 72 hours and 
then screened for the presence of a duplicated body axis (dorsalization) (Figure 7). 
Using XWnt8-injected (2-10 pg) as a positive control (Christian et al. (1991)) and water- 

25 injected or non-injected embryos as negative controls, we replicated the published 

observation that Zmax(LRP5) + Wnt5a (500 and 20 pg, respectively) could induce axis 
duplication. Wnt5a (20 pg) alone could not induce axis duplication (as previously 
reported by Moon et al. (1993)). We have also injected GFP RNA (100-770 pg) as a 
negative control to show that the amount of RNA injected is not perturbing embryo 

30 development (not shown). Strikingly, HBM + Wnt5a (500 and 20 pg, respectively) 
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yielded an approximately 3.5 fold more robust response of the phenotype (p=0.043 by 
Fisher's exact test) compared to Zmax(LRPS) + Wnt5a, suggesting that the HBM 
mutation is activating the Wnt pathway (Figures 8 and 9). The HBM/Wnt5a embryos 
also appear to be more "anteriorized" than the Zmax(LRP5)/Wnt5a embryos, again 
suggestive of a gain-of-f unction mutation. 

The role of Dkk-1 as a modulator of Zmax/LRP5- and HBM-mediated Wnt 
signaling was investigated. Literature reports have previously characterized Xenopus 
and murine Dkk-1 as antagonists of the canonical Wnt pathway in the Xenopus system 
(Glinka et a/. f Nature 391:357-362 (1998)). Using the human Dkk-1 construct, a dose- 
response assay was performed to confirm that our construct was functional and to 
identify the optimal amount of RNA for microinjection. Using 250 pg/embryo of hDkk-1 
RNA, over 90% (p<0.001) of the embryos were observed to display enlarged anterior 
structures (big heads) as anticipated from the published reports (Figure 10). 

The mechanism of hDkk-1 modulation of Wnt signaling in the presence of 
Zmax/LRP5 or HBM was also investigated. Without any hDkk-1 present, it was 
confirmed that HBM + Wnt5a was a more potent activator of Wnt signaling than 
Zmax/LRP5 + Wnt5a (p<0.05). Interestingly, in the presence of hDkk-1 (250 pg), 
Zmax/LRP5-mediated Wnt signaling was repressed (p<0.05) but hDkk-1 was unable to 
repress HBM-mediated Wnt signaling (p<0.01) (Figure 11). The specificity of this 
observation can be further addressed by investigating other members of the Dkk family, 
other Wnt genes, LRP6, additional Zmax/LRP5 mutants, and the peptide aptamers. 

Example 7 

Effects of exogenous Dkk and LRP5 on Wnt signaling in the TCF-luciferase Assay 

Wnt activity can be antagonized by many proteins including secreted Frizzled 
related proteins (SFRPs), Cerberus, Wnt Inhibitory Factor-1 and Dkk-1 (Krupnik era/., 
1999). The Dkk family of proteins consists of Dkk-1 -4 and Soggy, a Dkk-3-like protein. 
Dkk-1 and Dkk-4 have been shown to antagonize Wnt mediated Xenopus embryo 
development, whereas Dkk-2, Dkk-3, and Soggy do not. Unlike many of these proteins 
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that antagonize Wnt activity by directly interacting with Wnt proteins, Dkk-1 acts by 
binding to two recently identified Wnt coreceptors. LRP5 and LRP6. (Mao et a/., 2001 ; 
Bafico et a/., 2001). The details of this interaction have been examined by the present 
inventors and Mao et al. using deletion constructs of LRP6, which demonstrated that 
5 EGF repeats 3 and 4 are important for Dkk-1 interaction. Accordingly, the activity of 
two Dkk proteins, Dkk-1 and Dkk-2, were investigated with various Wnt members, 
LRP5, LRP6, and the mutant form of LRP5, designated HBM. The present invention 
explores whether there is any functional difference between LRP5 and HBM with regard 
to Dkk action on Wnt mediated signaling. Various reagents were developed, including 
10 Dkk-1 peptides, constrained LRP5 peptide aptamers, constrained Dkk-1 peptide 

aptamers and polyclonal antibodies to Dkk-1 (in Example 5 above) to identify factors 
that mimic HBM mediated Wnt signaling. 

Methods 

15 Various LRP5 constrained peptides were developed. Specifically, four peptides 

that interact with the LBD of LRP5 (Figure 4,constructs OST259-262 in Figure 12) and 
three peptides that interact with the cytoplasmic domain of LRP5 (constructs OST266- 
OST268 in Figure 12). In addition two Dkk-1 peptides were developed: constructs 
OST264 and OST265 in Figure 12, corresponding to Dkk-1 amino acids 139-266 and 

20 96-245, containing the smallest region of Dkk-1 that interacts with LRP5 (Figure 6). 

The cDNA clones encoding the LRP5 LBD interacting peptides and the Dkk-1 peptides 
were subcloned into pcDNA3.1 with the addition of a Kozak and signal sequence to 
target the peptide for secretion. The constructs encoding the three peptides interacting 
with the cytoplasmic domain of LRP5 were also subcloned into pcDNA3.1. However, 

25 these latter constructs do not contain a signal sequence. 

HOB-03-CE6 osteoblastic cells developed by Wyeth Ayerst (Philadelphia, PA) 
were seeded into 24-well plates at 150,000 cells per well in 1 ml of the growth media 
(D-MEM/F12 phenol red-free) containing 10% (v/v) heat-inactivated FBS, 1X penicillin 
streptomycin, and 1X Glutamax-1, and incubated overnight at 34°C. The following day, 

30 the cells were transfected using Lipofectamine 2000® (as described by the 
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manufacturer, Invitrogen) in OptiMEM (Invitrogen) with 0.35 /xg /well of LRP5, HBM, or 
control plasmid DNA (empty vector pcDNA3.1) and either Wnt1 or Wnt3a plasmid 
DNA. Similar experiments were performed with LRP6 plasmid DNA (0.35yug/well) or a 
control pEDdpc4 empty vector. Furthermore, each of these groups were then divided 
into three groups, those receiving 0.35 jug/well Dkk-1, Dkk-2, or pcDNA3.1 control DNA. 
All wells were transfected with 0.025 //g/well of CMV beta-galactosidase plasmid DNA 
and 0.35 /4j/well 16X TCF(AS)-luciferase reporter DNA (developed by Ramesh Bhat, 
Wyeth-Ayerst (Philadelphia, PA)). After 4 hours of incubation, the cells were rinsed and 
1 ml of fresh growth media was added to each well. The cells were cultured overnight 
at 34 °C, followed by a wash and a change of media. Cells were cultured for an 
additional 18-24 hours at 37°C. Cells were then lysed with 50 /J/well of 1X lysis buffer. 
The extracts were assayed for beta-galactosidase activity (Galacto Reaction Buffer 
Diluent & Light Emission Accelerator, Tropix) using 5 iA extract + 50 fA beta- 
galactosidase diluent and luciferase activity (Luciferase Assay Reagent, Promega) 
using 20 jA extract. 

U20S human osteosarcoma cells were also utilized. U20S cells (ATCC) were 
seeded into 96-well plates at 30,000 cells per well in 200ul of the growth media 
(McCoy's 5A) containing 10% (v/v) heat-inactivated FBS, 1X penicillin streptomycin, 
and 1X Glutamax-1, and incubated overnight at 37°C. The following day, the nmedia 
was replaced with OptiMEM (Invitroge) and cells were transfected using Lipofectamine 
2000® (as described by the manufacturer, Invitrogen) with 0.005jjg/well of LRP5, HBM, 
LRP6 or contol plasmid DNA (empty vector pcDNA3.1) and either Wnt1 (.0025ug/well) 
or Wnt3a (.0025ug/well) plasmid DNA. In addition, the 16x-(AS) TCF-TK-firefly- 
luciferase (Ramesh Bhat, WHRI, Wyeth) and control TK-renilla luciferase (Promega 
Corp.) were co-transfected at 0.3ug/well and 0.06ug/well respectively in all 
experiments. Futhermore, each of these groups was then divided into different groups, 
those receiving 0.05ug/well Dkk-1, Dkk-2, Dkk3, Dkk1 -Alkaline Phosphatase (AP), 
mutant Dkk-1 (C220A), Soggy or pcDNA3.1 control DNA. In other experiments, cells 
were co-transfected with 0.005 pg/well of LRP5, 0.0025ug/well of Wnt1 or Wnt3a (using 
0.0025 pg/well of a control pcDNA3.1) with LRP5-interacting aptamers (0.05ug/well). 
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Cells were cultured for an additional 18-20 hours at 37°C. Culture medium was 
removed. Cells were cultured for an additional 18-20 hours at 37°C. Culture medium 
was removed. Cells were then lysed with 100 /zl/well of 1X Passive Lysis Buffer (PLB) 
of Dual Luciferase Reagent kit (DLR-kit-Promega Corp.) 20 /A of the lysates were 
5 combined with LARII reagent of DLR-kit and assayed for TCF-firefly luciferase signal in 
Top Count (Packard) instrument. After measuring the Firefly readings, 100 fx\ of the 
"Stop and Glo" reagent of DLR kit that contains a quencher and a substrate for renilla 
luciferase was added into each well. Immediately the renilla luciferase reading was 
measured using the Top Count (Packard) Instrument. The ratios of the TCF-firefly 
10 luciferase to control renilla readings were calculated for each well and the mean ratio of 
triplicate or more wells was expressed in all data. 

Results 

The results of these experiments demonstrate that Dkk-1, in the presence of 

15 Wnt1 and LRP5, significantly antagonized TCF-luciferase activity (Figure 14). In 

marked contrast, Dkk-1 had no effect on HBM/Wntl mediated TCF-luciferase activity 
(Figure 14). In similar experiments, Dkk-1 was also able to antagonize LRP5/Wnt3a 
but not HBM/Wnt3a mediated TCF-luciferase activity (Figure 15). These results 
indicate that the HBM mutation renders Dkk-1 inactive as an antagonist of Wnt1 and 

20 Wnt3a signaling in HOB03CE6 osteoblastic cells. In other experiments with Wnt1 , Dkk- 
1 had no effect on LRP5 or HBM mediated TCF-luciferase activity (Figure 14). In 
contrast, with either LRP5 or HBM in the presence of Wnt3a, Dkk-2 was able to 
antagonize the TCF-luciferase activity (Figure 15). These latter results indicate that the 
HBM mutation has no effect on Dkk-2 action in the presence of Wnt3a. Experiments 

25 were also performed using the closely related LRP6 cDNA in HOB-03-CE6 cells. In 
these experiments, LRP6/Wnt1 and LRP6/Wnt3a mediated TCF-luciferase were 
regulated in the same manner as LRP5. Specifically, Dkk-1 antagonized LRP6/Wnt1 
mediated TCF-luciferase activity, whereas Dkk-2 had no effect (Figure 14). However, 
similar to the action of Dkk-2 with LRP5/Wnt3a, Dkk-2 was able to antagonize 

30 LRP6/Wnt3a mediated TCF-luciferase activity (Figure 1 5). 
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The results in the U20S cells show a robust effect of the OST262 LRP5 peptide 
aptamer activation of Wnt signaling in the presence of Wnt3a (Figure 16). These 
functional results are confirmed by the results shown below in Example 1 1 using LRP5 
peptide aptamers in the Xenopus assay. Such results affirmatively demonstrate that 
5 the effects of small molecules on LRP5/LRP6/HBM signaling can be detected using the 
TCF-luciferase assay. 

These data demonstrate that there is a functional difference between LRP5 and 
HBM regarding the ability of Dkk-1 to antagonize Wnt1 and Wnt3a signaling. These 
data and previous data showing that Dkk-1 directly interacts with LRP5 suggests that 
1 0 the inability of Dkk-1 to antagonize HBM/Wnt signaling may in part contribute to the 
HBM phenotype. These experiments further demonstrate the ability to test various 
molecules (e.g., small molecules, aptamers, peptides, antibodies, LRP5 interacting 
proteins or Dkk-1 interacting proteins, and the like) for a LRP5 ligand that mimics HBM 
mediated Wnt signaling or factors that block Dkk-1 interaction with LRP5. 

15 

Example 8 

Yeast-2 Hybrid Interaction Trap 
Small molecule inhibitors (or partial inhibitors) of the Dkk-LRP interaction may be 
an excellent osteogenic therapeutic. One way to investigate this important protein- 

20 protein interaction is using Y2H techniques substantially as described above and as is 
well known in the art. Regions of LRP5, such as LRP5 LBD, have been found to 
functionally interact with Dkk. This interaction is quantitated using a reporter element 
known in the art, e.g., LacZ or luciferase, which is only activated when bait and prey 
interact. The Y2H assay is used to screen for compounds which modulate the LRP- 

25 Dkk interaction. Such a modulation would be visualized by a reduction in reporter 

element activation signifying a weaker or disrupted interaction, or by an enhancement 
of the reporter element activation signifying a stronger interaction. Thus, the Y2H assay 
can be used as a high-throughput screening technique to identify compounds which 
disrupt or enhance Dkk interaction with LRP5/LRP6/HBM, which may serve as potential 

30 therapeutics. 
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For example, the Interaction Trap methodology can be used as follows. The 
LRP5 LBD, for example, was fused with LexA and Dkk-1 was fused with either Gal4-AD 
or B42. With the LRP5LBD-LexA bait and the Gal4AD-Dkk prey, over a 20-fold 
activation of a lacZ reporter (under the control of a single LexA operator) was detected 
over the background. Using a Dkk-1 mutant (C220A) that is unable to bind to LRP, the 
interaction was reduced in yeast, showing the specificity of this interaction and system 
(Figure 18). As a result, small molecules may be identified that modulate this 
interaction between LRP and Dkk. 

Example 9 

Cell-Based Functional High-Throughput Assay 
To develop a high throughput assay, the TCF-luciferase assay described in 
Example 7 was modified utilizing low level expression of endogenous LRP5/6 in U20S 
and HEK293 cells. However, HOB-03-CE6 cells and any other cells which show a 
differential response to Dkk depending on whether LRP5, LRP6 or HBM are 
expressed. Using U20S (human osteosarcoma) and HEK293 (ATCC) cells, the TCF- 
luciferase and tk-Renilla reporter element constructs were co-transfected along with 
Wnt3a/1 and Dkk. Wnt3a alone, by using endogenous LRP5/6, was able to stimulate 
TCF reporter gene activation. When Dkk, is co-transfected with Wnt3a/Wnt 1 and 
reporters (TCF-luci and tk-Renilla), Dkk represses reporter element activity. In addition, 
the TCF-luci signal is activated by Wnt3a/Wnt1 can be repressed by the addition of 
Dkk-enriched conditioned media to the cells containing Wnt3a/Wnt1 and reporters. The 
assay is further validated by the lack of TCF-reporter inhibition by a point mutant 
construct (C220A) of Dkk1 . 

The Dkk-mediated repression of the reporter is dependent upon the 
concentration of transfected Dkk cDNA or on the amount of Dkk-conditioned media 
added. In addition, the Dkk-mediated reporter suppression can be altered by the co- 
transfection of LRP5, LRP6, and HBM cDNAs in the U20S or HEK293 cells. In 
general, U20S cells show greater sensitivity to Dkk-mediated reporter suppression than 
that in HEK-293 cells. In U20S cells, the transfection of LRP5/LRP6/HBM cDNA leads 
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to moderate activation of TCF-luci in the absence of Wnt3a/Wnt1 transfection. This 
activation presumably utilizes the endogenous Wnts present in U20S cells. Under this 
condition, Dkk1 can repress TCF-luci and shows a differential signal between LRP5 
and HBM. By co-transfecting Wnt3a/Wnt1 , there is a generalized increase in the TCF- 
luci signal in the assay. Further, one can detect Dkk-mediated differential repression of 
the reporter due to LRP5 and HBM cDNA expression as well as between LRP5 and 
LRP6 cDNA. The repression is maximal with LRP6, moderate with LRP5, and least 
with HBM cDNA expression. In addition, the assay can detect the functional impact of 
the LRP5 interacting peptide aptamers (Figure 4), Dkk1 interacting aptamers and 
binding domains of Dkk-1 (Figure 6; OST264 and OST265 of Figures 12 and 13). 

Using this system with a suppressed Wnt-TCF signal due to the presence of 
both Dkk and Wnt3a, one can screen for compounds that could alter Dkk modulation of 
Wnt signaling, by looking for compounds that activate or the TCF-luciferase reporter, 
and thereby relieve the Dkk-mediated repression of the Wnt pathway. Such 
compounds identified may potentially serve as HBM-mimetics and be useful, for 
example, as osteogenic therapeutics. Data generated from this high throughput screen 
are demonstrated in Figures 19-21. Figure 19 shows that Dkk1 represses Wnt3a- 
mediated signaling in U20S bone cells. Figure 20 demonstrates the functional 
differences between LRP5, LRP6, and HBM. Dkk-1 represses LRP6 and LRP5 but has 
little or no effect on HBM-generated Wnt1 signaling in U20S cells. Figure 21 
demonstrates the differential effects of various Dkk family members and modified Dkks, 
including Dkk-1, a mutated Dkk-1 (C220A), Dkk-1 -AP (modified with alkaline 
phosphatase), Dkk-3, and Soggy. 

* 

Example 10 

. DKK/LRP5/6/HBM ELISA Assay 
A further method to investigate Dkk binding to LRP is via ELISA assay. Two 
possible permutations of this assay are exemplified. LRP5 is immobilized to a solid 
surface, such as a tissue culture plate well. One skilled in the art will recognize that 
other supports such as a nylon or nitrocellulose membrane, a silicon chip, a glass slide, 
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beads, etc. can be utilized. In this example, the form of LRP5 used is actually a fusion 
protein where the extracellular domain of LRP5 is fused to the Fc portion of human IgG. 
The LRP5-Fc fusion protein is produced in CHO cell extracts from stable cell lines. The 
LRP5-Fc fusion protein is immobilized on the solid surface via anti-human Fc antibody 
or by Protein-A or Protein G-coated plates, for example. The plate is then washed to 
remove any non-bound protein. Conditioned media containing secreted Dkk protein or 
secreted Dkk-epitope tagged protein (or purified Dkk or purified Dkk-epitope tagged 
protein) is incubated in the wells and binding of Dkk to LRP is investigated using 
antibodies to either Dkk or to an epitope tag. Dkk-V5 epitope tagged protein would be 
detected using an alkaline phosphatase tagged anti-V5 antibody. 

Alternatively, the Dkk protein could be directly fused to a detection marker, such 
as alkaline phosphatase. Here the detection of the Dkk-LRP interaction can be directly 
investigated without subsequent antibody-based experiments. The bound Dkk is 
detected in an alkaline phosphatase assay. If the Dkk-alkaline phosphatase fusion 
protein is bound to the immobilized LRP5, alkaline phosphatase activity would be 
detected in a colorimetric readout. As a result, one can assay the ability of small 
molecule compounds to alter the binding of Dkk to LRP lising this system. 
Compounds, when added with Dkk (or epitope-tagged Dkk) to each well of the plate, 
can be scored for their ability to modulate the interaction between Dkk and LRP based 
on the signal intensity of bound Dkk present in the well after a suitable incubation time 
and washing. The assay can be calibrated by doing cold competition experiments with 
unlabeled Dkk or with a second type of epitope-tagged Dkk. Any small molecule that is 
able to modulate the Dkk-LRP interaction may be a suitable therapeutic candidate, 
more preferably an osteogenic therapeutic candidate. 

Example 11 

Functional Evaluation of Peptide Aptamers in Xenopus 
The constrained peptide aptamers constructs OST258-263 (where 258 contains 
the signal sequence by itself and 263 contains an irrelevant constrained peptide) 
(Figures 12 and 13) were used to generate RNA substantially as described in Example 
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7, except the vector was linearized by restriction endonuclease digestion and RNA was 
generated using T7 RNA polymerase. 

Aptamer RNA was injected at 250 pg per blastomere using the protocol of 
Example 7. Wnt signaling was activated, as visualized by embryo dorsalization 
(duplicated body axis) with aptamers 261 and, more strongly, 262. The results of this 
assay are shown in Figures 22 and 23. These results suggest that aptamers 261 and 
262 are able to activate Wnt signaling possibly by binding to the LBD of LRP, thereby 
preventing the modulation of LRP-mediated signaling by Dkk. 

The aptamers of the present invention can serve as HBM-mimetics. In the 
Xenopus system they are able to induce Wnt signaling all by themselves. They may 
also serve as tools for rational drug design by enhancing the understanding of how 
peptides are able to interact with LRP and modulate Wnt signaling at the specific amino 
acid level. Thus, one would be able to design small molecules to mimic their effects as 
therapeutics. In addition, the aptamers identified as positives in this assay may be used 
as therapeutic molecules themselves. 

Example 12 

Homogenous Assay 

An excellent method to investigate perturbations in protein-protein interactions is 
via Fluorescence Resonance Energy Transfer (FRET). FRET is a quantum mechanical 
process where a fluorescent molecule, the donor, transfers energy to an acceptor 
chromophore molecule which is in close proximity. This system has been successfully 
used in the literature to characterize the intermolecular interactions between LRP5 and 
Axin (Mao et al., Molec. Cell Biol. 7:801-809). There are many different fluorescent 
tags available for such studies and there are several ways to fluorescently tag the 
proteins of interest. For example, CFP (cyan fluorescent protein) and YFP (yellow 
fluorescent protein) can be used as donor and acceptor, respecively. Fusion proteins, 
with a donor and an acceptor, can be engineered, expressed, and purified. 

For instance, purified LRP protein, or portions or domains thereof, fused to CFP 

* 

and purified Dkk protein, or portions or domains thereof that interact with Dkk or LRP 
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respectively, fused to YFP can be generated and purified using standard approaches. 
If LRP-CFP and Dkk-YFP are in close proximity, the transfer of energy from CFP to 
YFP will result in a reduction of CFP emission and an increase in YFP emission. 
Energy is supplied with an excitation wavelength of 450 nm and the energy transfer is 
5 recorded at emission wavelengths of 480 nm and 570 nm. The ratio of YFP emission 
to CFP emission provides a guage for changes in the interaction between LRP and 
Dkk. This system is amenable for screening small molecule compounds that may alter 
the Dkk-LRP protein-protein interaction. Compounds that disrupt the interaction would 
be identified by a decrease in the ratio of YFP emission to CFP emission. Such 
10 compounds that modulate the LRP-Dkk interaction would then be considered candidate 
HBM mimetic molecules. Further characterization of the compounds can be done 
using the f CF-luciferase or Xenopus embryo assays to elucidate the effects of the 

* 

compounds on Wnt signaling. 

While the above example describes a cell-fee, solution-phase assay using 
15 purified components, a similar cell-based assay could also be performed. For example, 
LRP-CFP fusion protein can be expressed in cells. The Dkk-YFP fusion protein then 
could be added to the cells either as purified protein or as conditioned media. The 
..interaction of LRP and Dkk is then monitored as described above. 

20 All references cited herein are hereby incorporated by reference in their entirety 

for all purposes. The following applications are also incorporated by reference in their 
entirety herein for all purposes: U.S. Application No. 60/290,071 , filed May 1 1 , 2001 ; 
U.S. Application No. 09/544,398, filed on April 5, 2000; U.S. Application No. 
09/543,771 , filed April 5, 2000; 09/578,900; U.S. Application No. 09/229,319, filed 

25 January 13, 1999; U.S. Provisional Application 60/071 ,449, filed January 13, 1998; and 
International Application PCT/US00/1 6951 , filed June 21, 2000; International PCT 
Application entitled "HBM Variants That Modulate Bone Mass and Lipid Levels," filed 
May 13, 2002; and International PCT Application entitled "Transgenic Animal Model of 
Bone Mass Modulation," filed May 13, 2002. Additionally, this application claims priority 

30 to U.S. provisional applications 60/291 ,31 1 , filed May 1 7, 2001 ; 60/353,058, filed 
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February 1, 2002; and 60/361,293, filed March 4, 2002; the texts of which are herein 
incorporated by reference in their entirety for all purposes. 
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CLAIMS 

We claim: 

1 . A method of regulating LRP5, LRP6, or HBM activity in a subject 
comprising administering a composition which modulates a Dkk activity in an 
amount effective to regulate LRP5, LRP6, or HBM activity. 

2. The method of any of Claims 1 , 24, 28, 33, 36, 37, 48, 64, 65, 93, 
98, 1 01 , 1 05, 1 07, 1 1 1 , or 1 1 2, wherein the Dkk is Dkk-1 . 

3. The method of any of Claims 1 , 24, 28, or 33, wherein the Dkk is 
Dkk-1 and the Dkk activity is inhibited. 

4. The method of Claims 1 or 24, wherein the Dkk activity modulates 
bone mass and/or lipid levels. 

5. The method of Claim 4, wherein bone mass is increased and/or 
lipid levels are decreased. 

6. The method of Claim 5, wherein the increase in bone mass is 
determined via one or more of a decrease in fracture rate, an increase in bone 
strength, an increase in bone density, an increase in bone mineral density, an 
increase in trabecular connectivity, an increase in trabecular density, an increase 
in cortical density, an increase in bone diameter, and an increase in inorganic 
bone content. 

7. The method of any of Claims 1 , 24, 28, or 33, wherein said 
composition comprises one or more compounds selected from the group 
consisting of Dkk interacting proteins, or a Dkk-binding fragment thereof. 
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8. The method of any of Claims 1 , 24, 28, or 33, wherein said 
composition comprises an antisense, a siRNA, or shRNA molecule which 
recognizes and binds to a nucleic acid encoding one or more Dkk interacting 
proteins. 

9. The method of any of Claims 1 , 24, 28, or 33, and wherein said 
composition comprises a Dkk peptide aptamer. 

1 0. The method of any of Claims 1 , 24, 28, or 33, wherein said 
composition comprises a mimetic of a Dkk peptide aptamer. 

* 

1 1 . The method of any of Claims 1 , 24, 28, or 33, wherein said 
composition inhibits Dkk binding to LRP5, LRP6, or HBM. 

1 2. The method of any of Claims 1 , 24, 28, or 33, wherein said 
composition enhances binding of Dkk to LRP5, LRP6, or HBM. 

1 3. The method of any of Claims 1 , 24, 28, or 33, wherein said 
composition comprises a Dkk interacting protein peptide aptamer. 

14. The method of any of Claims 1 , 24, 28, or 33, wherein said 
composition comprises a mimetic of a Dkk interacting protein peptide aptamer. 

1 5. The method of any of Claims 1 , 24, 28 or 33, wherein said 
composition inhibits Dkk interacting protein or Dkk-binding fragment thereof 
binding to Dkk. 

1 6. The method of any of Claims 1 , 24, 28, or 33, wherein said 
composition enhances binding of Dkk interacting protein or Dkk-binding 
fragment thereof to Dkk. 
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1 7. The method of any of Claims 1 , 24, 28, or 33, wherein said subject 
is a vertebrate or an invertebrate organism. 

1 8. The method of any of Claims 1 , 24, 28, or 33, wherein said subject 
5 is a mammal. 

* 

1 9. The method of any of Claims 1 , 24, 28, or 33, wherein said subject 
is a canine, a feline, an ovine, a primate, an equine, a porcine, a caprine, a 
camelid, an avian, a bovine, or a rodent. 

10 

20. The method of Claim 19, wherein said primate is a human. 

21. The method of any of Claims 1 , 24, 28, or 33, wherein said 
composition comprises an LRP5 peptide aptamer. 

15 

22. The method of Claim 21 , wherein said peptide aptamer is OST262 
(SEQ ID NO:208). 

23. The method of any of Claims 1 , 24, 28 or 33, wherein the 

20 composition comprises an LRP5 antibody or an immunologically active fragment 
thereof. 

24. A method of regulating Dkk-Wnt pathway activity in a subject 
comprising administering a composition which modulates Dkk activity in an 

25 amount effective to regulate Dkk-Wnt pathway activity. 

25. The method of Claims 24, 101 , or 107, wherein the Wnt is one or 
more of Wnt1-Wnt19. 
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26. The method of Claim 25, wherein the Wnt is Wnt1 , Wnt3, Wnt3a, 
orWntlOb. 

27. The method of Claim 24 wherein said composition which 
modulates Dkk activity or modulates Dkk interaction with LRP5/LRP6/HBM is 
administered in an amount effective to modulate Wnt signaling. 

28. A method of modulating bone mass in a subject comprising 
administering to the subject a composition which modulates Dkk activity or Dkk 
interaction with LRP5, LRP6, or HBM in an amount effective to modulate bone 
mass in the subject. 

29. The method of Claim 28, wherein bone mass is increased. 

30. The method of the previous claim, wherein the increase in bone 
mass is determined via one or more of a decrease in fracture rate, an increase 
in bone strength, an increase in bone density, an increase in bone mineral 
density, an increase in trabecular connectivity, an increase in trabecular density, 
an increase in cortical density, an increase in bone diameter, and an increase in 
inorganic bone content. 

31 . The method of Claims 28 or 36, wherein said subject has a bone 
mass disorder selected from the group consisting of a bone development 
disorder, a bone fracture, age-related loss of bone, chrondrodystrophy, drug- 
induced bone disorder, high bone turnover, hypercalcemia, hyperostosis, 
osteogenesis imperfecta, osteomalacia, osteomyelitis, osteoporosis, Paget's 
disease, osteoarthritis, and rickets. 

32. The method of Claim 28, wherein the composition which 
modulates Dkk activity or Dkk interaction with LRP5, LRP6, or HBM is 
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administered in an amount effective to modulate the amount of trabecular and/or 
cortical tissue. 

33. A method of modulating lipid levels in a subject comprising 
administering to the subject a composition which modulates Dkk activity or Dkk 
interaction with LRP5, LRP6, or HBM in an amount effective to modulate lipid 
levels in the subject. 

34. The method of Claim 33, wherein lipid levels are decreased. 

35. The method of Claim 33 or 36, wherein the subject has a lipid- 
modulated disorder and wherein the lipid-modulated disorder is selected from 
the group consisting of a cardiac condition, atherosclerosis, familial lipoprotein 
lipase deficiency, familial apoprotein Cll deficiency, familial type 3 
hyperlipoproteinemia, familial hypercholesterolemia, familial 
hypertriglyceridemia, multiple lipoprotein-type hyperlipemia, elevated lipid 
levels due to dialysis and/or diabetes, and elevated lipid levels of unknown 
etiology. 

36. A method of diagnosing low or high bone mass and/or high or low 
lipid levels in a subject comprising examining expression of Dkk, LRP5, LRP6, 
HBM, or and HBM-like variant in the subject and determining whether Dkk, 
LRP5, LRP6, HBM, or an HBM-like variant is over- or under-expressed to 
determine whether subject has (a) high or low bone mass and/or (b) has high or 
low lipid levels. 

37. A method of screening for a compound which modulates the 
interaction of Dkk with LRP5, LRP6, HBM, or a Dkk-binding fragment of LRP5, 
LRP6, or HBM comprising: 
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(a) exposing Dkk and a LRP5, LRP6, and/or HBM binding fragment 
thereof to a compound; and 

(b) determining whether said compound modulates Dkk interaction 
with the LRP5/LRP6/HBM binding fragment. 

38. The method of Claim 37, wherein said modulation is determined 
by whether said compound binds to Dkk or the LRP5, LRP6, or HBM binding 
fragment thereof. 

39. The method of Claim 37, wherein Dkk or a LRP-binding fragment 
thereof is attached to a substrate. 

40. The method of Claim 37, wherein said compound comprises one 
or more compounds selected from the group consisting of Dkk interacting 
proteins, or a Dkk-binding fragment thereof. 

41 . The method of Claim 37 or 48, wherein said compound comprises 
a Dkk peptide aptamer. 

42. The method of Claim 37 or 48, wherein said compound comprises 
a mimetic of a Dkk peptide aptamer. 

43. The method of Claim 37 or 48, wherein said compound comprises 
a Dkk interacting protein peptide aptamer. 

44. The method of Claim 37 or 48, wherein the compound comprises 
an LRP5 peptide aptamer. 

45. The method of Claim 44, wherein the peptide aptamer is OST262 
(SEQ ID NO:208). 
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46. The method of Claim 37 or 48, wherein the compound comprises 
an LRP5 antibody. 

47. The method of Claim 37 or 48, wherein said compound is a 
5 mimetic of a Dkk interacting protein peptide aptamer. 

48. A method of screening for a compound which modulates the 
interaction of Dkk with a Dkk interacting protein comprising: 

(a) exposing a Dkk interacting protein or a Dkk-binding fragment 
10 thereof to a compound; and 

(b) determining whether said compound bound to a Dkk interacting 
protein or the Dkk-binding fragment thereof; and 

(c) further determining whether said compound modulates the 
interaction of Dkk interacting protein and Dkk. 

15 

49. The method of Claim 48, wherein the Dkk interacting protein or a 
Dkk-binding fragment thereof is attached to a substrate. 

50. A composition comprising a LRP5, LRP6, or HBM activity- 
20 modulating compound and a pharmaceutical^ acceptable carrier therefor. 

51 . The composition of Claim 50, wherein said LRP5, LRP6, or HBM 
activity-modulating compound comprises a compound which binds to Dkk 
thereby modulating the interaction of Dkk with LRP5, LRP6, or HBM. 

25 

52. The composition of Claim 50, wherein said LRP5, LRP6, or HBM 
modulating compound comprises one or more Dkk interacting proteins and Dkk- 
binding fragments thereof. 
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53. The composition of Claim 50, wherein said LRP5, or LRP6, or 
HBM modulating compound is a monoclonal antibody or an immunologically 
active fragment thereof which binds to a Dkk interacting protein, or a Dkk-binding 
fragment thereof. 

i 

54. The composition of Claim 53, wherein the monoclonal antibody is 
human, chimeric, humanized, primatized®, or bispecific. 

55. The composition of Claim 50, wherein said LRP5, LRP6, or HBM 
modulating compound comprises an antisense, a siRNA, or shRNA molecule 
which recognizes and binds to a nucleic acid encoding one or more Dkk 
interacting proteins. 

56. The composition of Claim 50, wherein said LRP5, LRP6, or HBM 
modulating compound comprises a Dkk peptide aptamer. 

57. The composition of Claim 50, wherein said LRP5, LRP6, or HBM 
modulating compound comprises a mimetic of a Dkk peptide aptamer. 

58. The composition of Claim 50, wherein said LRP5, LRP6, or HBM 
modulating compound comprises a Dkk interacting protein peptide aptamer. 

59. The composition of Claim 50, wherein said LRP5, LRP6, or HBM 
modulating compound comprises a mimetic of a Dkk interacting protein peptide 
aptamer. 

60. The composition of Claim 50, wherein the compound comprises 
an LRP5 peptide aptamer. 
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61 . The composition of Claim 60, wherein the peptide aptamer is 
OST262. 

62. The composition of Claim 50, wherein the compound comprises 
an LRP5 antibody. 

63. A pharmaceutical composition comprising a compound which 
modulates Dkk activity and a pharmaceutical^ acceptable carrier therefor. 

64. A method for identifying compounds which modulate Dkk and 
LRP5/LRP6/HBM interactions comprising: 

(a) creating an LRP5, LRP6, or HBM fluorescent fusion protein using 
a first fluorescent tag; and 

(b) creating a Dkk fusion protein comprising a second fluorescent tag; 

(c) adding a test compound; and 

(d) assessing changes in the ratio of fluorescent tag emissions using 
Fluorescence Resonance Energy Transfer (FRET) or Bioluminescence 
Resonance Energy Transfer (BRET) to determine whether the compound 
modulates Dkk and LRP5/LRP6/HBM interactions. 

65. A method of identifying binding partners for a Dkk protein 
comprising the steps of: 

(a) exposing the Dkk protein(s) or a LRP5/LRP6 binding 
fragment thereof to a potential binding partner; and 

(b) determining if the potential binding partner binds to a Dkk 
protein or the LRP5/LRP6 binding fragment thereof. 

66. A nucleic acid encoding a Dkk interacting protein peptide aptamer 
comprising a nucleic acid encoding a scaffold protein in-frame with the activation 
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domain of Gal4 or LexA that is in-frame with a nucleic acid that encodes a Dkk 
interacting protein amino acid sequence. 

67. A vector comprising the nucleic acid of Claim 66. 

68. The nucleic acid of Claim 66, wherein the scaffold protein is trxA. 

69. A method of detecting a modulatory activity of a compound on the 
binding interaction of a first peptide and a second peptide of a peptide binding 
pair that bind through extracellular interaction in their natural environment, 
comprising: 

(i) culturing at least one eukaryotic cell comprising: 

a) a nucleotide sequence encoding a first heterologous fusion 
protein comprising the first peptide or a segment thereof 
joined to a transcriptional activation protein DNA binding 
domain; 

b) a nucleotide sequence encoding a second heterologous 
fusion protein comprising the second peptide or a segment 
thereof joined to a transcriptional activation protein 
transcriptional activation domain; 

wherein binding of the first peptide or segment thereof and the 
second peptide or segment thereqf reconstitutes a transcriptional 
activation protein; and 

c) a reporter element activated under positive transcriptional 
control of the reconstituted transcriptional activation protein, 
wherein expression of the reporter element produces a 
selected phenotype; 

(ii) incubating the eukaryotic cell in the presence of a compound 
under conditions suitable to detect the selected phenotype; and 
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(iii) detecting the ability of the compound to affect the binding 

interaction of the peptide binding pair by determining whether the 
compound affects the expression of the reporter element which 
produces the selected phenotype; 
5 wherein (1 ) said first peptide is a Dkk peptide and the second peptide is a 
peptide selected from LRP5, HBM, LRP6 and the Dkk-binding portion of 
LRP5/LRP6/HBM or (2) said first peptide is a Dkk interacting protein or the Dkk- 
binding fragment thereof and said second peptide is a Dkk peptide. 

10 70. The method of Claim 69, wherein the eukaryotic cell is a yeast 

cell. 

71 . The method of Claim 70, wherein the yeast cell is 
Saccharomyces. 

15 

72. The method of Claim 71 , wherein the Saccharomyces cell is 
Saccharomyces cerevisiae. 

73. The method of Claim 69, wherein the Dkk is Dkk-1 and wherein 
20 the compound comprises one or more Dkk interacting proteins or a Dkk-binding 

fragment thereof. 

74. The method of Claim 73, wherein the compound is directly added 
to assay. 

25 

75. The method of Claim 73, wherein the compound is recombinantly 
expressed by said eukaryotic cell in addition to said first and second peptides. 

76. The method of Claim 69, wherein the compound comprises a Dkk 
30 peptide aptamer. 
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77. The method of Claim 69, wherein the compound comprises a 
mimetic of a Dkk peptide aptamer. 

78. The method of Claim 69, wherein the compound comprises a Dkk 
interacting protein peptide aptamer. 

79. The method of Claim 69, wherein the compound comprises a 
mimetic of a Dkk interacting protein peptide aptamer. 

80. The method of Claim 69, wherein the eukaryotic cell further 
comprises at least one endogenous nucleotide sequence selected from the 
group consisting of a nucleotide sequence encoding the DNA binding domain of 
a transcriptional activation protein, a nucleotide sequence encoding the 
transcriptional activation domain of a transcriptional activation protein, and a 
nucleotide sequence encoding the reporter element, wherein at least one of the 
endogenous nucleotide sequences is inactivated by mutation or deletion. 

81 . The method of Claim 69, wherein the peptide binding pair 
comprises a ligand and a receptor to which the ligand binds. 

82. The method of Claim 69, wherein the transcriptional activation 
protein is Gal4, Gcn4, Hap1 , Adr1 , Swi5, Ste12, Mcm1, Yap1, Ace1, Ppr1, 
Arg81, Lac9, Qa1F, VP16, or a mammalian nuclear receptor. 

83. The method of Claim 69, wherein at least one of the heterologous 
fusion proteins is expressed from an autonomously-replicating plasmid. 

84. The method of Claim 69, wherein the DNA binding domain is a 
heterologous DNA-binding domain of a transcriptional activation protein. 
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85. The method of Claim 84, wherein the DNA binding protein is 
selected from the group consisting of a mammalian steroid receptor and 
bacterial LexA protein. 

86. The method of Claim 69, wherein the reporter element is selected 
from the group consisting of /acZ, a polynucleotide encoding luciferase, a 
polynucleotide encoding green fluorescent protein (GFP), and a polynucleotide 
encoding chloramphenicol acetyltransferase. 

87. The method of Claim 86, wherein the reporter element is LacZ. 

88. The method of Claim 69, wherein the test sample comprises an 
LRP5 peptide aptamer. 

89. The method of Claim 88, wherein the peptide aptamer is OST262 
(SEQ ID NO:208). 

90. The method of Claim 69, wherein the test sample comprises an 
LRP5 antibody. 

91 . A transgenic animal wherein Dkk-1 is knocked out in a tissue- 
specific fashion. 

92. The transgenic animal of Claim 91 , wherein the tissue specificity is 
bone tissue, cancer tissue, or liver tissue. 

93. A method for identifying potential compounds which modulate Dkk 
activity comprising: 

a) measuring the effect on binding of one or more Dkk interacting 
proteins, or a Dkk-binding fragment thereof, with Dkk or a 
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fragment thereof in the presence and absence of a compound; 
and 

b) identifying as a potential Dkk modulatory compound a 
compound which modulates the binding between one or more Dkk 
5 interacting proteins or Dkk-binding fragment thereof and Dkk or 

fragment thereof. 

94. A peptide aptamer of Figure 3 (SEQ ID NOs:171-188) or Figure 4 
(SEQ ID NOs:189-192). 

10 

95. An antibody or antibody fragment which recognizes and binds to 
one or more peptides of amino acid sequences GNKYQTIDNYQPYPC (SEQ ID 
NO:118 ), LDGYSRRTTLSSKMYHTKGQEG (SEQ ID NO:119), 
RIQKDHHQASNSSRLHTCQRH (SEQ ID NO:120), RGEIEETITESFGND (SEQ 

15 ID NO:121), EIFQRCYCGEGLSCRIQKD (SEQ ID NO:122), 

•MYWTDWVETPRIE (SEQ ID NO: 123), MYWTDWGETPRIE (SEQ ID NO: 124), 
KRTGGKRKEILSA (SEQ ID NO:125), ERVEKTTGDKRTRIQGR (SEQ ID 
NO:126), KQQCDSFPDCIDGSDE (SEQ ID NO:127), or a Dkk-1 amino acid 
sequence selected from the group consisting Asn34-His266 (SEQ ID NO:1 10), 

20 Asn34-Cys245 (SEQ ID NO:111), Asn34-Lys1 82 (SEQ ID NO:112), Cys97- 
His266 (SEQ ID NO:113), Val139-His266 (SEQ ID NO:114), Gly1 83-His266 
(SEQ ID NO:115), Cys97-Cys245 (SEQ ID NO:116), or Val139-Cys245 (SEQ ID 
NO:117). 

25 96. The antibody or antibody fragment of Claim 95, wherein the 

antibody is a monoclonal antibody. 

97. The antibody or antibody fragment of Claim 95, wherein the 
antibody is a polyclonal antibody 

30 
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98. A method of identifying Dkk interacting proteins which modulate 
the interaction of Dkk with the Wnt signaling pathway comprising: 

(a) injecting Dkk and potential Dkk interacting protein mRNA into a 
Xenopus blastomere; and 
5 (b) assessing axis duplication or analyzing marker gene expression; 

and 

(c) identifying compositions which elicit changes in axis duplication or 
marker gene expression as Dkk interacting proteins which modulate the 
interaction of Dkk with the Wnt signaling pathway. 

10 

99. The method of Claim 98, wherein the mRNA of HBM, LRP5/6, any 
Wnt, Wnt antagonist, Wnt pathway modulator, or combination of these is co- 
injected into the Xenopus blastomere. 

15 100. The method of Claim 98, wherein the marker gene analyzed is 

Siamois, Xnr3, slug, Xbra, HNK-1, endodermin, Xlhbox8, BMP2, BMP4, XLRP6, 
EF-1 , or ODC. 

101 . A method for identifying Dkk interacting proteins which modulate 
20 the interaction of Dkk with the Wnt signaling pathway comprising: 

(a) transfecting cells with constructs containing Dkk and potential Dkk 
interacting proteins; and 

(b) assessing changes in expression of a reporter gene linked to a 
Wnt-responsive promoter; and 

25 (c) identifying as a Dkk interacting protein any protein which alters 

reporter gene expression compared with cells transfected with a Dkk construct 
alone. 

102. The method of Claim 101 , wherein the cells are HOB-03-CE6, 
30 HEK293, or U20S cells. 
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1 03. The method of Claim 101, wherein the Wnt-responsive promoter is 
TCF or LEF. 

1 04. The method of Claim 101, wherein the cells are co-transfected 
with CMV P-galactosidase. 

105. A method for identifying compounds which modulate Dkk and 
LRP5/LRP6/HBM interactions comprising: 

(a) immobilizing LRP5/LRP6/HBM to a solid surface; and 

(b) treating the solid surface with a secreted Dkk protein or a secreted 
epitope-tagged Dkk and a test compound; and 

(c) determining whether the compound regulates binding between 
Dkk and LRP5/LRP6/HBM using antibodies to Dkk or the epitope tag or by 
directly measuring activity of an epitope tag. 

1 06. The method of Claim 1 05, wherein the epitope tag is alkaline 
phosphatase, histidine, or a V5 tag. 

107. A method for identifying compounds which modulate the 
interaction of Dkk with the Wnt signaling pathway comprising: 

(a) transfecting cells with constructs containing Dkk and Wnt proteins; 

(b) assessing changes in expression of a reporter element linked to a 
Wnt- responsive promoter; and 

(c) identifying as a Dkk/Wnt interaction modulating compound any 
compound which alters reporter gene expression compared with cells 
transfected with a Dkk construct alone. 

108. The method according to Claim 107, wherein Wnt3a and Wnt1 
constructs are co-transfected into the cells. 
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109. The method according to Claim 107, wherein the cells are U2-OS, 
HOB-03-CE6, or HEK293 cells. 

* 

1 1 0. The method according to Claim 1 07, wherein the reporter element 
5 used is TCF-luciferase, tk-Renilla, or a combination thereof. 

111- A method of testing compounds that modulate Dkk-mediated 
activity in a mammal comprising 

(a) providing a group of transgenic animals having (1) a regulatable one 
10 or more Dkk genes, (2) a knock-out of Dkk genes, or (3) a knock-in of 

one or more Dkk genes; 

(b) providing a second group of control animals respectively for the group 
of transgenic animals in step (a); and 

(c) exposing the transgenic animal group and control animal group to a 
15 potential Dkk-modulating compound which modulates bone mass or lipid 

levels; and 

(d) comparing the transgenic animals and the control group of animals 
and determining the effect of the compound on bone mass or lipid levels 
in the transgenic animals as compared to the control animals. 

20 

112. A method of screening for compounds or compositions which 
modulate the interaction of Dkk and a Dkk interacting protein comprising: 

(a) exposing a Dkk interacting proteins or a Dkk- 
binding fragment thereof to a compound; and 
25 (b) determining whether said compound binds to a 

Dkk interacting proteins or the Dkk-binding 
fragment thereof. 
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1 1 3. The method of Claim 1 1 2, wherein said modulation is determined 
by whether said compound binds to the Dkk interacting protein or the Dkk- 
binding fragment thereof. 



5 1 1 4. An antibody or antibody fragment which recognizes and binds to a 

sequence depicted in Figure 3 (SEQ ID NOs:171-188) or Figure 4 (SEQ ID 
NOs: 189-1 92). 
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Model of Wnt signaling 
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Sequence of baits used in Y2H screens 
>DKK1 (SEQIDN0:168) 

AATTCCAACGCTATCAAGAACCTGCCCCCACCGCTGGGCGGCGCTG 

CGGGGCACCCAGGCTCTGCAGTCAGCGCCGCGCCGGGAATCCTGTA 

CCCGGGCGGGAATAAGTACCAGACCATTGACAACTACCAGCCGTAC 

CCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCT 

AGTCCCACCCGCGGAGGGGACGCGGGCGTGCAAATCTGTCTCGCCT 

GCAGGAAGCGCCGAAAACGCTGCATGCGTCACGCTATGTGCTGCCC 

CGGGAATTACTGCAAAAATGGAATATGTGTGTCTTCTGATCAAAAT 

CATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTA 

ATGATCATAGCACCTTGGATGGGTATTCCAGAAGAACCACCTTGTC 

TTCAAAAATGTATCACACCAAAGGACAAGAAGGTTCTGTTTGTCTC 

CGGTCATCAGACTGTGCCTCAGGATTGTGTTGTGCTAGACACTTCTG 

GTCCAAGATCTGTAAACCTGTCCTGAAAGAAGGTCAAGTGTGTACC 

AAGCATAGGAGAAAAGGCTCTCATGGACTAGAAATATTCCAGCGTT 

GTTACTGTGGAGAAGGTCTGTCTTGCCGGATACAGAAAGATCACCA 

TCAAGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGAGACACTAA 
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>zmax1 LBD1 (SEQ ID NO: 169) 

CTCATCCTGCCCCTGCATGGACTGAGGAACGTCAAAGCCATCGACTAT 

GACCCACTGGACAAGTTCATCTACTGGGTGGATGGGCGCCAGAACATC 

AAGCGAGCCAAGGACGACGGGACCCAGCCCTTTGTTTTGACCTCTCTG 

AGCCAAGGCCAAAACCCAGACAGGCAGCCCCACGACCTCAGCATCGA 

CATCTACAGCCGGACACTGTTCTGGACGTGCGAGGCCACCAATACCAT 

CAACGTCCACAGGCTGAGCGGGGAAGCCATGGGGGTGGTGCTGCGTG 

GGGACCGCGACAAGCCCAGGGCCATCGTCGTCAACGCGGAGCGAGGG 

TACCTGTACTTCACCAACATGCAGGACCGGGCAGCCAAGATCGAACGC 

GCAGCCCTGGACGGCACCGAGCGCGAGGTCCTCTTCACCACCGGCCTC 

ATCCGCCCTGTGGCCCTGGTGGTAGACAACACACTGGGCAAGCTGTTC 

TGGGTGGACGCGGACCTGAAGCGCATTGAGAGCTGTGACCTGTCAGG 

GGCCAACCGCCTGACCCTGGAGGACGCCAACATCGTGCAGCCTCTGGG 

CCTGACCATCCTTGGCAAGCATCTCTACTGGATCGACCGCCAGCAGCA 

GATGATCGAGCGTGTGGAGAAGACCACCGGGGACAAGCGGACTCGCA 

TCCAGGGCCGTGTCGCCCACCTCACTGGCATCCATGCAGTGGAGGAAG 

TCAGCCTGGAGGAGTTCTCAGCCCACCCATGTGCCCGTGACAATGGTG 

GCTGCTCCCACATCTGTATTGCCAAGGGTGATGGGACACCACGGTGCT 

CATGCCCAGTCCACCTCGTGCTCCTGCAGAACCTGCTGACCTGTGGAG 

AGCCGCCCACCTGCTCCCCGGACCAGTTTGCATGTGCCACAGGGGAGA 

TCGACTGTATCCCCGGGGCCTGGCGCTGTGACGGCTTTCCCGAGTGCG 

ATGACCAGAGCGACGAGGAGGGCTGCCCCGTGTGCTCCGCCGCCCAGT 

TCCCCTGCGCGCGGGGTCAGTGTGTGGACCTGCGCCTGCGCTGCGACG 

GCGAGGCAGACTGTCAGGACCGCTCAGACGAGGCGGACTGTGACGCC 

ATCTGCCTGCCCAACCAGTTCCGGTGTGCGAGCGGCCAGTGTGTCCTC 

ATCAAACAGCAGTGCGACTCCTTCCCCGACTGTATCGACGGCTCCGAC 

GAGCTCATGTGTGAAATCACCAAGCCGCCC 
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>zmax1 LBD4 (SEQ ID NO: 170) 

AGGGCCATCGTCGTCAACGCGGAGCGAGGGTACCTGTACTTCACCAA 

CATGCAGGACCGGGCAGCCAAGATCGAACGCGCAGCCCTGGACGGCA 

CCGAGCGCGAGGTCCTCTTCACCACCGGCCTCATCCGCCCTGTGGCCC 

TGGTGGTAGACAACACACTGGGCAAGCTGTTCTGGGTGGACGCGGAC 

CTGAAGCGCATTGAGAGCTGTGACCTGTCAGGGGCCAACCGCCTGAC 

CCTGGAGGACGCCAACATCGTGCAGCCTCTGGGCCTGACCATCCTTGG 

CAAGCATCTCTACTGGATCGACCGCCAGCAGCAGATGATCGAGCGTG 

TGGAGAAGACCACCGGGGACAAGCGGACTCGCATCCAGGGCCGTGTC 

GCCCACCTCACTGGCATCCATGCAGTGGAGGAAGTCAGCCTGGAGGA 

GTTCTCAGCCCACCCATGTGCCCGTGACAATGGTGGCTGCTCCCACAT 

CTGTATTGCCAAGGGTGATGGGACACCACGGTGCTCATGCCCAGTCCA 

CCTCGTGCTCCTGCAGAACCTGCTGACCTGTGGAGAGCCGCCCACCTG 

CTCCCCGGACCAGTTTGCATGTGCCACAGGGGAGATCGACTGTATCCC 

CGGGGCCTGGCGCTGTGACGGCTTTCCCGAGTGCGATGACGAGAGCG 

ACGAGGAGGGCTGCCCCGTGTGCTCCGCCGCCCAGTTCCCCTGCGCGC 

GGGGTCAGTGTGTGGACCTGCGCCTGCGCTGCGACGGCGAGGCAGAC 

TGTCAGGACCGCTCAGACGAGGCGGACTGTGACGCCATCTGCCTGCC 

CAACCAGTTCCGGTGTGCGAGCGGCCAGTGTGTCCTCATCAAACAGC 

AGTGCGACTCCTTCCCCGACTGTATCGACGGCTCCGACGAGCTCATGT 

GTGAAATCACCAAGCCGCCCTAAGCGGCCGC 
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Listed are the pcDNA3.1 construct names followed by the DNA sequence 
OST258 (control for OST 259-OST262 and OST264,OST265) 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTTCCAGGTTC 
ATCC 

OST259 (SEQ ID NO: 193) 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTAXGGGra 
ATCCATGAGCGATAAAATTATTCACXTGACTGACGACAGTTTTGACACGGAT^ 

TCGTCGATTTCTGGGCAGAGTGGTGCGGTCCGAATTCCGTGGTTCTGTGTTCGCGTrGTGGGCGTTTGTGGCGGTGG 
TCGTGTGGGACTAGTGGTCCGTGCAAAATGATCGCCCC<»TTCTG^ 

GACCGTTGCAAAACTGAACATCGATCAAAACCCTGGCACTGCGCCGAAATATGGCATCCGTTO 

TGCTGTTCAAAAACGGTGAAGTGGCGGCAACCAAAGTGGGTGCACTGTCTAAAGGTCAGT 

GCTAACCTGGCGTAAGCGGCCGC 

.OST260 < SE Q ID N0: i94 > 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGT^ 
ATCCATGAGCGATAAAATTATTCACCTGACTGACGAC^GTTTTGA 

TCGTCGATTTCTGGGCAGAGTGGTGCGGTCCGAATTCCGGGTGGCGGTGGTGTGGTCGGTGTGGGGCTTTGTGGTGG 

CGGCGTGTTACTAGTGGTCCGTGGRAAATGATCGCCCCGATTCTGGATGAAATCGC^ 

GACCGTTGCAAAACTGAACATCGATCAAAACCCTGGCACTGCGCCGA^ 

TGCTGTTO^W^CGGTGAAGTGGCGGCAACCA^ 

GCTAACCTGGCGTAAGCGGCCGC 

9 t 9 

■ * 

OST261 (SEQ ID NO: 195) 
^AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCT 
ATCCATGAGCGATAAJU^TTATTCACCTGACTGACGACAGTTTTGACACGGATGTAC 

TCGTCGATTTC3JJ«$lH<y^ 

TTTCTGTTGACTAGTGGTCCGTGCAAAATGATCGCCCCGATTCTGGATGAAATCGCTGACGAATATCAGGGCAAACT 

GACCGTTGCAAAACTGAR.CATCGATCAAAACCCTGGCACTGCGCCGAAATATGGCATCCGTGGTAT 

TGCTGTTCAAMVACGGTGAAGTGGCGGCAACCAAAGTC 

GCTAACCTGGCGTAAGCGGCCGC 

OST262 (SEQ ID NO: 196) • 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCT^ 
ATCCATGAGCGATAAAATTATTCACCTGACTGACGACAGTTTTGACACGGATGT^ 

TCGTCGATTTCTGGGCAGAGTGGTGCGGTCCGAATTCCGGTGGTGGGGGGATGATTTGGGAGGCTTGGAGTTGTTAT 
GCGTGTGGGACTAGTGGTCCGTGCAAAATGATCGCCCCGATTCTGGATGA^ 
. GACCGTTGCAAAACTGAACATCGATCAAAACCCTGGCACTGCGCCGAAATATGGCAT 
TGCTGTTCAAAAACGGTGAAGTGGGGGCAACCAAAGTGGGTGCACTGTCTAAAGGTCAGTTGAAAG^ 

GCTAACCTGGCGTAAGCGGCCGC 
OST263 (SEQ ID NO: 197) 

AAGCtTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 
ATCCATG&GCGATAAAATTATTCACCTGACTGACGACAGTTTTGAC^ 

TCGTCGATTTCTGGGCAGAGTGGTGCGGTCCGAATTCCTTGTGGATTGGGCCGGGTGATCAGGGTCTGTTTCGGCGT 
TTTGTTTTTACTAGTGGTCCGTGCAAAATGATCGCCCCGATTCTGGATGAAATCGCTGACGAATATCAGGGCAAACT 
GACCGTTGCAAAACTGAACATCGATC^AAACCCTGGC^CTGCGCCGAAATATGGCATCC 
TGCTGTTCAAAJ^CGGTGAAGTGGCGGCAACCAAAGTGGGTGCACTGTCTAAAG^^ 
GCTAACCTGGCGTAAGCGGCCGC ' . ^ A 
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OST254 CSEQ ID NO: 198) 

j^GCTTGCCACCATGGAGACAGACACACTCCTGCTATGOTTACTGCTGCTCTGGGTTCaVGGTrCCACTGGTGACGG 
ATCCGTGTCTTCTGATCAAAATCATTTC^ 

GCACCXTGGATGGGTATTCCAGAAGAACCACCTXGTCTTCAAAAAT GTAXCACACCAAAGGACAAGAAGGTXCTGX X 

TGTCTCCGGTCATCAGACTGTGCCTCAGGATTGTGTTG?GCTAGACACrXCTGGTCCAAGATCTGTAflACCTGTCCT 
GAAAGAAGGTCAAGTGTGTACCAAGCATAGGAGAAAAGGCTCTC 

GAGAAGGTCTGTCTTGCCGGATACMAAAGATCACCATC^ 

CACTAAGCGGCCGC 

OST265 (SEQ ID NO; 199) \ . 

AAGCTTGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTTCCAGGTTCCACTGGTGACGG 
AXCCTGCGCTAGTCCCACCCGCGGAGGGGACGCGGGCGTGCAAAXCTGXCTCGCCTG^ 

GCAXGCGXCAGGCTAXGTGCTGOCCCGGGAATXACTGCAAAAATGGAAIAXGTGXGTCXXCXGA^ 

CGAGGAGAAAITGAGGAAAC CATCAGTGAAAGCTTT GGTAAT GATCAXAGCACCXT GGAT GGGTATT QCAGAAGAAC 

CACCTTGTCTTCAAAAATGTATCACACCAAAGGACAAGAAGGTTCT GXTXGTCTCCGGXCATCAGACTGX GCCTCAG 

GATXGXGXIGrGCTAGACACXTCTGGXCCAAGAXCTGTAAACCXGXCCTGZU^ 

AGGAGAAAAGGCT CTCAXGGAC XAGAAATATTCCAGC GT V GT TACT GTGGAGAAGGT CTGT CTTGCTAAGCGGCCGC 
OST266 (SEQ ID NO: 200) 

AAGCTTGCCACCATGGGCGATAAARTTATTCACCTGACTGACGACAGTTTTGACACGGAT 

GGCGATCCXCGTCGATTTCTGGGCAGI^GTGGTGCGGTCCGAATTCCXATGCGTGGTTGTTTTCTTGTAGTAGGTGTA 

GGTGGTGGTTGCCTXGGACTAGTGG7CCGTGCAAAATGATCGCCCCGATTCTGGATGAAOT 

GGCAAACXGACCGTTGCAAAACTGAACAXCGATCAAAACCCXGGCACTGCGCCGJ^^ 

GACTCTGCTGCTGTTCAAAAACGGTGAAGTGGCGGCAACaU^^ 

TCCTCGACGCTAACCTGGCGTAAGCGGCCGC 

* ■ * 

OST267 (SEQ ID NO: 201) 

AAGC T TGCCACCAT GGGCGAT AAAAXT ATTCACCTGACT GACGACAGXTTT GACAC GGAT GT ACTCAAAG C G GACGG 
GGCGAXCCTCGXCGAXXTCTGGGCAQVGXGGTGCGGXCCGAAX7CCATTXGXGAGGTTGXGAGGXTG7GGAGXCGGX 
ATCCTTGGTCriGGGTGACXAGXGGrCCGTGCAAAAXGAXCGCCGCGATXCTGGAXGAAATCGCXGACGAATATCAG 
GGCAAACTGACCGTTGCAAAACTGAACATCGAXCAJU\ACCCT 

GACTCTGCTGCTGTTCAAAAACGG7GAAGTGGCGGCAACCAAAGTGGGXGCACTGTCTARAGGTCAGTXQAAAGAGT 
TCCTCGACGCTAACCXGGCGTAAGCGGCCGC ' 

■ 

* m * 

OSX268 (SEQ ID NO: 202) 

AAGCTTGCCACCA7GGGCGAXAAAAT7AXTCACCTGACXGACGACAGTTTXGACACGGAXG7ACXCAAA 
GGCGAXCCTCGXCGAXTXCTGGGCAGAGXGGTGCGGTCCGAATTCCGGXXGIACTAGXGCGGTGTGTGGTGCXTGGG 
CXGAGGCGGGTAGGTTT T ATTGT ACTAGTGGT CCGTGCAAAAT GATCGCCCCGAXTCTGGATGAAAXCGCT GACGAA 
T ATCAGGGCAAACT GACCGTT GCAAftACT GAACATCGAT CAAAACCCX GGCACTGCGCCGAAATATGGCAT C CGTGG 
TATCCCGACTCTGCTGCTGTXG^AAAACGGTGAAGXGGCGGGAACCAAAGTGGGXGCACXGXCTAAAGGXGAGXXGA 

AAGAGXXCCXCGACGCTAACCTGGCGTAAGCGGCCGC 

OSX269 ( fl5rSavant°cont:rol peptide for OST266-OSX268) 
AAGCTTGCCACCATGGGCGATAAAAXTATTCACCTGACTGACGACAGTTTTGACACGGA^ 

GGCGATCCXCGTCGAXTTCXGGGCAGAGTGGXGCGGTCCGAATXCCTXGT GGAXTGGGCCGGGXGAT CAGGGT CXGX 
TTCGGCGTXXTGXTXTTACTAGTGGTCCGTGCAAAATGATCGCCCCGAXTCTGGATGAAATCGCTGACGAATATCAG 
GGCAAACXGACCGXXGCRAAACTGAACAXCGATOUUUlCCCrGGCACTGCGCCGAAAXATGGCAXCCGIGGXATCCC 
GACXCXGCTGCXGTXCAAAAACGGXGAAGTGGCGGCAACCAAAGTGGGTGCACXGTCXAAAGGTCAGTTGAAAGAGX 

TCCTCGACGCTAACCTGGCGTAAGCGGCCGC 
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lasted neiow are the amino acid sequences corresponding to the pcONA3 1 

constructs in Appendix 1A 

OST258 

MBTDTLLLWVLLLWVPGSTGDGS 

OST259 (SEQ ID NO: 204) 

METDTIXLWVLLLWVPGSTGDGSMSDCT^ 
. SGPCKMIAPILDEIMEYQGKLTVAKI^I^ 

OST260 (SEQ ID NO: 205) 

METDTLI^WVLLLWVPGSTGDGSMSDKI^ 

SGPCKMIAPILDEIADEYQGIQjTVAKIiNIDQNPGTAPKYGI^ 

OST261 (SEQ ID NO: 206) 
METDTI^XWVLLLWVPGSTGDGSMSDKXIHL^ 

SGPCKMI AP ILDE IADEYQGKLTVAKLNI DQNPGTAPKYGIRGI PTLLLFKNGEVAATKVGALS KGQLKEFLDANLA 

OST262 (SEQ ID NO: 207) 

METDTLLLWVLLLWVPGSTGDGSMSDK1IHLTDDSFD 

SGPCKMIAPILDEIADEYQGKLWAKLNIDQNPGTAPKYGIRGIPT^ 

OST263 ( SE Q * D N0: 208 > 

METDTLLLWVLIJiWVPGSTGDGSMSDKIIHLTDDSFDTDVLKADGMLVDFWAEW 
SGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPT^ 

OSr264 (SEQ ID NO: 209) 
METOTLLLWVLIJiliroPGSTGDGSVSSDQNHFRG 

SDCASGLCCARHFWSKICKFVLKEGQVCTKH^ 

» • 

OST265 (SEQ ID »G: 210) 

METDTLLLWVLLLWVPGSTGDGSCMPrRGGDAGVQICLACRKRRKRC^ 

EETITES FGNDflSTLDGYSRRTTLSSKMYHTKGQEGSVCLRS S DCAS GLCCARHFWS KI CKPVLKE GQ VCTKHRRKG 
SHGLEI 

FQRCYCGEGLSC . 

OST2SS (SEQ ID NO: 211) * 

MGDKIIlbjTDDSFDTDVLKADGAILVDFWAEWCGPNSY 
VAKLNIDQNPGTAPKTGIRGIPTLLLE^ 

OST267 (SEQ ID NO: 212) 
MGDKIIHLTDDSFDTDVIJCADGAILVDF^ 

VAKLNIDQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQIJCEFLDA^ 
OST268* (SEQ ID NO: 213) 

MGDKIIH1TDDSFDTDVLKADGAILVDTOAEWCGPNSGCT 

LTVAKLNIDQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEF1DANLA 

OST269 (SEQ ID NO: 214) 
MGDmHLTDDSFDTDVLKADGAILVDFffA^ 

VAKLNI DQNPGTAPKYGIRGI PTLL1FKNGEVAATKVGALSKGQLKE FLDMLA 
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SEQUENCE LISTING 



<110> Allen, Kristina M. 
Anisowicz, Anthony 
Bhat, Bheem 
Damagnez, Veronique 
Robinson, John 
Yaworsky, Paul 

<120> Reagents and Method for Modulating DKK-Mediated Interactions 

<130> 032796-132 

<150> US 60/291,311 
<151> 2001-05-17 

<150> US 60/353,058 
<151> 2002-02-01 

<150> US 60/361,293 
<151> 2002-03-04 

<160> 214 

<170> FastSEQ for Windows Version 4.0 



<210> 1 

<211> 5120 

<212> DNA 

<213> Homo sapiens 



<400> 1 

actaaagcgc cgccgccgcg ccatggagcc cgagtgagcg cggcgcgggc ccgtccggcG 60 
gccggacaac atg gag gca gcg ccg ccc ggg ccg ccg tgg ccg ctg ctg 109 

Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu 

15 10 



ctg 


ctg 


ctg 


ctg 


ctg 


ctg 


ctg 


gcg 


ctg 


tgc 


ggc 


tgc 


ccg 


gcc 


ccc 


gcc 


157 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Ala 


Leu 


Cys 


Gly 


Cys 


Pro 


Ala 


Pro 


Ala 






15 










20 










25 












gcg 


gcc 


teg 


ccg 


etc 


ctg 


eta 


ttt 


gcc 


aac 


cgc 


egg 


gac 


gta 


egg 


ctg 


205 


Ala 


Ala 


Ser 


Pro 


Leu 


Leu 


Leu 


Phe 


Ala 


Asn 


Arg 


Arg 


Asp 


Val 


Arg 


Leu 




30 










35 










40 










45 




gtg 


gac 


gcc 


ggc 


gga 


gtc 


aag 


ctg 


gag 


tec 


ace 


ate 


gtg 


gtc 


age 


ggc 


253 


Val 


Asp 


Ala 


Gly 


Gly 


Val 


Lys 


Leu 


Glu 


Ser 


Thr 


He 


Val 


Val 


Ser 


Gly 












50 










55 










60 






ctg 


gag 


gat 


gcg 


gcc 


gca 


gtg 


gac 


ttc 


cag 


ttt 


tec 


aag 


gga 


gcc 


gtg 


301 


Leu 


Glu 


Asp 


Ala 


Ala 


Ala 


Val 


Asp 


Phe 


Gin 


Phe 


Ser 


Lys 


Gly 


Ala 


Val 










65 










70 










75 








tac 


tgg 


aca 


gac 


gtg 


age 


gag 


gag 


gcc 


ate 


aag 


cag 


ace 


tac 


ctg 


aac 


349 


Tyr 


Trp 


Thr 


Asp 


Val 


Ser 


Glu 


Glu 


Ala 


He 


Lys 


Gin 


Thr 


Tyr 


Leu 


Asn 








80 










85 










90 










cag 


acg 


ggg 


gcc 


gcc 


gtg 


cag 


aac 


gtg 


gtc 


ate 


tec 


ggc 


ctg 


gtc 


tct 


397 
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vjxn inr 


Giy 


Ala 


Ala 


Val 


Gin 


Asn 


Val 


Val 


T 1 - 

lie 


Ser 


Gly Leu Val Ser 














100 










105 




r* r+ r* nap 

uUC yaw* 


ggc 


ccc 


gcc 


tgc 


gac 


tgg gtg 


*m Af* f% 

ggc 


aag 


aag 


ctg tac tgg acg 


AAC 

445 


fro Msp 


C3±y 


Leu 


Ala Cys 


Asp Trp Val 


mm 

Gly 


Lys 


Lys Leu Tyr Trp Thr 




1 1U 








115 










120 




125 




gac tea 


gag 


acc 


aac 


cgc 


ate 


gag 


gtg 


Atmmt Atmm. s-m. 

gcc 


aac 


etc 


aat ggc aca tec 


493 


Asp ber 




Tnr 


Asn Arg 


He 


Glu 


Val 


Ala 


Asn 


Leu Asn Gly Thr Ser 










130 










IOC 

135 






140 




egg aag 


gtg 


etc 


ttc 


tgg 


cag 


gac 


ctt 


gac 


cag 


ccg 


agg gcc ate gcc 


541 


Arg Lys 


Val 


Leu 


Phe 


Trp 


Gin 


Asp 


Leu 


Asp 


1 mm. 

Gin 


Pro Arg Ala He Ala 








145 










150 








155 




ctg gac 


ccc 


get 


cac 


ggg 


tac 


atg 


tac 


tgg 


aca 

% 


gac 


tgg ggt gag acg 


f\ f\ 

589 


Leu Asp 


Pro 


Ala 


His 


Gly 


Tyr 


Met 


Tyr 


Trp 


Tnr 


Asp Trp Gly Glu Thr 






loO 










165 










170 




ccc egg 


_ i_ A_ 

att 


gag 


egg 


gca 


ggg 


atg 


gat 


ggc 


age 


acc 


egg aag ate att 


*T 

637 


Pro Arg 


lie 


Glu 


Arg Ala 


Gly Met Asp Gly 


Ser 


Thr 


Arg Lys He lie 




175 










180 










185 






gtg gac 


teg 


gac 


att 


tac 


tgg 


ccc 


aat 


gga 


ctg 


acc 


ate gac ctg gag 


a*+ r\ r* 

685 


val Asp 


Ser 


Asp 


He 


Tyr 


Trp Pro Asn Gly 


Leu 


Thr 


He Asp Leu Glu 




190 








195 










200 




205 




gag cag 


aag 


etc 


tac 


tgg 


get 


gac 


gcc 


aag 


Mmi 4- A 

etc 


age 


ttc ate cac cgt 


733 


Glu Gin 


Lys 


Leu 


Tyr Trp 


Ala Asp Ala 


Lys 


Leu 


Ser 


Phe He His Arg 










210 










215 






220 




gcc aac 


ctg 


-*mmf *S J ■» 

gac 


ggc 


teg 


ttc 


egg 


cag 


aag 


gtg 


gtg 


gag ggc age ctg 


781 


Ala Asn 


Leu 


Asp 


Gly Ser 


Phe Arg Gin 


Lys 


Val 


Val 


Glu Gly Ser Leu 








o o c 

225 










230 








235 




acg cac 


ccc 


ttc 


gcc 


ctg 


acg 


etc 


tec 


ggg 


gac 


act 


ctg tac tgg aca 


t\ *\ r\ 
829 


Thr His 


Pro 
240 


Phe 


Ala 


Leu 


Thr 


Leu 
245 


Ser 


Gly 


Asp 


Thr 


Leu Tyr Trp Thr 
250 




gac tgg 


cag 


acc 


cgc 


tec 


ate 


cat 


gcc 


tgc 


aac 


aag 


cgc act ggg ggg 


877 


Asp Trp 


Gin 


Thr 


Arg 


Ser 


He 


His 


Ala 


Cys 


Asn 


Lys Arg Thr Gly Gly 




o c c 

255 










260 










265 






aag agg 


aag 


gag 


ate 


ctg 


agt 


gcc 


etc 


tac 


tea 


ccc 


atg gac ate cag 


ft rt c 

925 


Lys Arg 


Lys 


Glu 


He 


Leu 


Ser 


Ala 


Leu 


Tyr 


Ser 


Pro 


Met Asp He Gin 




OTA 

2 7Q 








275 










280 




285 




gtg ctg 


age 


cag 


gag 


egg 


cag 


cct 


ttc 


ttc 


cac 


act 


cgc tgt gag gag 


973 


val Leu 


C** <w* 

ber 


Gin 


Glu 


Arg 


Gin 


Pro 


Phe 


Phe 


HIS 


Thr Arg Cys Glu Glu 










290 










295 






300 




gac aac 


ggc 


ggc 


tgc 


tec 


cac 


ctg 


tgc 


ctg 


ctg 


tec 


cca age gag cct 


1021 


Asp Asn 


Gly 


Gly 

"3 AC 


Cys 


Ser 


His 


Leu 


Cys 
310 


Leu 


Leu 


Ser 


Pro Ser Glu Pro 
315 




tec cac 


aca 


cgc 


gcc 


tgc 


ccc 


acg 


ggt 


gtg 


cag 


ctg 


cag gac aac ggc 


T rt c Q 

1069 


Pne Tyr 


Tnr 


Lys 


Ala 


Cys 


Pro 


Thr 


Gly Val 


Gin 


Leu 


Gin Asp Asn Gly 




• 


J2U 










325 










330 




agg acg 


cgc 


aag 


gca 


gga 


gcc 


gag 


gag 


gtg 


ctg 


ctg 


ctg gcc egg egg 


1117 


Arg Thr 


Cys 


Lys 


Ala 


Gly 


Ala 


Glu 


Glu 


Val 


Leu 


Leu 


Leu Ala Arg Arg 




335 










340 










345 






acg gac 


eta 


egg 


agg 


ate 


teg 


ctg 


gac 


acg 


ccg 


gac 


ttc acc gac ate 


1165 


Thr Asp 


▼ .... 

Leu 


Arg 


Arg 


He 


Ser 


Leu Asp 


Thr 


Pro 


Asp 


Phe Thr Asp He 




rt e rt 

350 








355 










360 




365 




gtg ctg 


cag 


gtg 


gac 


gac 


ate 


egg 


cac 


gcc 


att 


gcc 


ate gac tac gac 


m rt i rt 

1213 


Val Leu 


Gin 


Val 


Asp Asp 


He Arg 


His 


Ala 


He 


Ala 


He Asp Tyr Asp 










370 










375 






380 




ccg eta 


gag 


ggc 


tat 


gtc 


tac 


tgg 


aca 


gat 


gac 


gag 


gtg egg gcc ate 


1261 


Pro Leu 


Glu 


Gly 


Tyr Val 


Tyr 


Trp 


Thr 


Asp 


Asp 


Glu 


Val Arg Ala He 








385 










390 








395 




cgc agg 


gcg 


tac 


ctg 


gac 


ggg 


tct 


ggg 


gcg 


cag 


acg 


ctg gtc aac acc 


1309 



Page 2 



WO 02/092015 



PCT/US02/15982 



032796-132. ST25 



Arg Arg 


Ala 


Tyr 


T mm * . 

Leu 


Asp Gly Ser Gly Ala 


Gin 


Thr 


Leu Val Asn 


Thr 






400 










405 










410 






gag ate 


aac 


gac 


ccc 


gat 


ggc 


ate 


gcg gtc gac tgg 


gtg gee cga 


aac 


1357 


Glu lie 


Asn Asp 


Pro 


Asp Gly 


He 


Ala 


Val Asp Trp Val Ala Arg Asn 




4 15 










420 










425 








etc tac 


tgg 


ace 


gac 


acg 


ggc 


acg 


gac 


cgc 


ate 


gag 


gtg acg cgc 


etc 


1405 


Leu Tyr 


Trp Thr 


Asp 


Thr Gly Thr Asp Arg 


He 


Glu Val Thr Arg Leu 




430 








435 










440 






445 




aac ggc 


ace 


tec 


cgc 


aag 


ate 


ctg 


gtg 


teg 


gag 


gac 


ctg gac gag 


ccc 


m* A ^ 

1453 


Asn Gly 


Thr 


Ser 


Arg 


Lys 


He 


Leu 


Val 


Ser Glu Asp 


Leu Asp Glu 


Pro 










450 










455 






460 






cga gec 


ate 


gca 


ctg 


cac 


ccc 


gtg 


atg 


ggc 


etc 


atg 


tac tgg aca 


gac 


1501 


Arg Ala 


He 


Ala 


Leu 


His 


Pro 


Val 


Met 


Gly Leu Met 


Tyr Trp Thr Asp 








465 










470 








475 






tgg gga 


gag 


aac 


cct 


aaa 


ate 


gag 


tgt 


gec 


aac 


ttg 


gat ggg cag 


gag 


1549 


Trp Gly 


Glu 


Asn 


Pro 


Lys 


He 


Glu Cys 


Ala 


Asn 


Leu 


Asp Gly Gin 


Glu 






480 










485 










490 






egg cgt 


gtg 


ctg 


gtc 


aat 


gee 


tec 


etc 


ggg 


tgg 


ccc 


aac ggc ctg 


gec 


1597 


Arg Arg 


Val 


Leu 


Val 


Asn 


Ala 


Ser 


Leu Gly Trp 


Pro 


Asn Gly Leu Ala 




4 95 










500 










505 








ctg gac 


ctg 


cag 


gag 


ggg 


aag 


etc 


tac 


tgg 


gga 


gac 


gec aag aca 


gac 


1645 


Leu Asp 


Leu 


Gin 


Glu 


Gly Lys 


Leu 


Tyr 


Trp 


Gly Asp Ala Lys Thr Asp 




510 








515 
















525 




aag ate 


gag 


gtg 


* 

ate 


aat 


gtt 


gat 


ggg 


acg 


aag 


agg 


egg acc etc 


ctg 


1693 


Lys lie 


Glu 


Val 


He 


Asn Val Asp Gly 


Tnr 


T m m, mm. 

Lys 


Arg Arg Thr Leu 


Leu 






4 




530 










■ETC 


■ 




540 






gag gac 


aag 


etc 


ccg 


cac 


att 


ttc 


ggg 


J_ mm mm* 

ttc 


acg 


ctg 


ctg ggg gac 


ttc 


1741 


Glu Asp 


Lys 


Leu 


Pro 


His 


He 


Phe Gly 


Phe 


Thr 


Leu 


Leu Gly Asp 


Phe 








545 










550 








555 






ate tac 


tgg 


act 


gac 


tgg 


cag 


cgc 


cgc 


age 


ate 


gag 


egg gtg cac 


aag 


1789 


lie Tyr 


Trp 


Thr 


Asp 


Trp Gin Arg Arg 


Ser 


He 


Glu Arg Val His 


Lys 






560 










565 










570 






gtc aag 


gee 


age 


egg 


gac 


gtc 


ate 


att 


gac 


cag 


ctg 


ccc gac ctg 


atg 


1837 


Val Lys 


Ala 


Ser 


Arg 


Asp Val 


He 


He 


Asp 


Gin 


Leu 


Pro Asp Leu Met 




575 










580 










585 








ggg etc 


aaa 


get 


gtg 


aat 


gtg 


gec 


aag 


gtc 


gtc 


gga 


acc aac ccg 


tgt 


1885 


Gly Leu 


Lys 


Ala 


Val 


Asn 


Val 


Ala 


Lys 


Val 


Val 


Gly Thr Asn Pro 


Cys 




590 








595 










600 






605 




gcg gac 


agg 


aac 


ggg 


ggg 


tgc 


age 


cac 


ctg 


tgc 


ttc 


ttc aca ccc 


cac 


i ft *5 *5 
1933 


Ala Asp 


Arg 


Asn 


Gly 


Gly Cys 


Ser 


His 


Leu 


Cys 


Phe 


Phe Thr Pro 


His 










610 










615 






620 






gca ace 


egg 


tgt 


ggc 


tgc 


ccc 


ate 


ggc 


ctg 


gag 


ctg 


ctg agt gac 


atg 


198 1 


Ala Thr 


Arg Cys 


Gly 


Cys 


Pro 


He Gly 


Leu 


Glu 


Leu 


Leu Ser Asp Met 








625 










630 








635 






aag ace 


tgc 


ate 


gtg 


cct 


gag 


gec 


ttc 


ttg 


gtc 


ttc 


acc age aga 


gee 


ft ft ft ft 

202 9 


Lys Thr 


Cys 


He 


Val 


Pro 


Glu 


Ala 


Phe 


Leu 


Val 


Phe 


Thr Ser Arg Ala 




640 










645 










650 






gee ate 


eac 


agg 


ate 


tec 


etc 


gag 


ace 


aat 


aac 


aac 


gac gtg gec 


ate 


2077 


Ala He 


His 


Arg 


He 


Ser 


Leu 


Glu 


Thr 


Asn 


Asn 


Asn 


Asp Val Ala 


He 




655 










660 










665 








ccg etc 


acg 


ggc 


gtc 


aag 


gag 


gec 


tea 


gee 


ctg 


gac 


ttt gat gtg 


tec 


2125 


Pro Leu 


Thr Gly 


Val 


Lys 


Glu 


Ala 


Ser 


Ala 


Leu 


Asp 


Phe Asp Val 


Ser 




670 








675 










680 






685 




aac aac 


cac 


ate 


tac 


tgg 


aca 


gac 


gtc 


age 


ctg 


aag 


acc ate age 


cgc 


2173 


Asn Asn 


His 


He 


Tyr 


Trp Thr Asp Val 


Ser 


Leu 


Lys 


Thr He Ser 


Arg 










690 










695 






700 






gec ttc 


atg 


aac 


ggg 


age 


teg 


gtg 


gag 


cac 


gtg 


gtg 


gag ttt ggc 


ctt 


2221 
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Ala 


Phe 


Met 


Asn 


Gly Ser 


Ser 


Val 


Glu His 


Val 


Val Glu Phe Gly Leu 










705 










710 




715 




gac 


tac 


ccc 


gag 


ggc 


atg 


gee 


gtt 


gac tgg 


atg 


ggc aag aac etc tac 


2269 


Asp 


Tyr 


Pro 


Glu 


Gly Met 


Ala 


Val Asp Trp 


Met 


Gly Lys Asn Leu Tyr 








720 










725 






730 




tgg 


gec 


gac 


act 


ggg 


acc 


aac 


aga 


ate gaa 


gtg 


gcg egg ctg gac ggg 


2317 


Trp Ala Asp 


Thr 


Gly Thr 


Asn Arg 


He Glu 


Val 


Ala Arg Leu Asp Gly 






735 










740 








745 




cag 


ttc 


egg 


caa 


gtc 


etc 


gtg 


tgg 


agg gac 


ttg 


gac aac ccg agg teg 


2365 


Gin Phe Arg 


Gin 


Val 


Leu 


Val 


Trp Arg Asp 


Leu 


Asp Asn Pro Arg Ser 




750 










755 








760 


765 




ctg 


gec 


ctg 


gat 


ccc 


acc 


aag 


ggc 


tac ate 


tac 


tgg acc gag tgg ggc 


2413 


Leu 


Ala 


Leu 


Asp 


Pro 
770 


Thr 


Lys 


Gly 


Tyr He 
775 


Tyr 


Trp Thr Glu Trp Gly 

780 




ggc 


aag 


ccg 


agg 


ate 


gtg 


egg 


gee 


ttc atg 


gac 


ggg acc aac tgc atg 


2461 


Gly 


Lys 


Pro 


Arg 


He 


Val 


Arg Ala 


Phe Met 


Asp 


Gly Thr Asn Cys Met 










785 










790 




795 




acg 


ctg 


gtg 


gac 


aag 


gtg 


ggc 


egg 


gee aac 


gac 


etc acc att gac tac 


2509 


Thr 


Leu 


Val 


Asp 


Lys 


Val 


Gly Arg Ala Asn 


Asp 


Leu Thr He Asp Tyr 








800 










805 






810 




get 


gac 


cag 


cgc 


etc 


tac 


tgg 


acc 


gac ctg 


gac 


acc aac atg ate gag 


2557 


Ala Asp Gin 


Arg 


Leu Tyr 


Trp 


Thr 


Asp Leu 


Asp 


Thr Asn Met He Glu 






815 










820 








825 




teg 


tec 


aac 


atg 


ctg 


ggt 


cag 


gag 


egg gtc 


gtg 


att gec gac gat etc 


2605 


Ser 


Ser 


Asn 


Met 


Leu Gly 


Gin 


Glu Arg Val 


Val 


He Ala Asp Asp Leu 




830 










835 








840 


845 




ccg 


cac ccg 


ttc 


ggt 


ctg 


acg 


cag 


tac age 


gat 


tat ate tac tgg aca 


2653 


Pro 


His 


Pro 


Phe 


Gly Leu 


Thr 


Gin 


Tyr Ser 


Asp 


Tyr He Tyr Trp Thr 






• 






850 








855 




860 

• 




gac 


tgg 


aat 


ctg 


cac 


age 


att 


gag 


egg gee 


gac 


aag act age ggc egg 


2701 


Asp Trp Asn 


Leu 


His 


Ser 


He Glu Arg Ala 


Asp 


• 

Lys Thr Ser Gly Arg 










865 










870 




875 




aac 


cgc 


acc 


etc 


ate 


cag 


ggc 


cac 


ctg gac 


ttc 


gtg atg gac ate ctg 


2749 


Asn Arg 


Thr 


Leu 


He 


Gin 


Gly His 


Leu Asp 


Phe 


Val Met Asp He Leu 








880 










885 






890 




gtg 


ttc 


cac 


tec 


tec 


cgc 


cag 


gat 


ggc etc 


aat 


gac tgt atg cac aac 


2797 


Val 


Phe 


His 


Ser 


Ser 


Arg 


Gin Asp Gly Leu 


Asn 


Asp Cys Met His Asn 






895 










900 








905 




aac 


ggg 


cag 


tgt 


ggg 


cag 


ctg 


tgc 


ctt gec 


ate 


ccc ggc ggc cac cgc 


2845 


Asn 


Gly Gin 


Cys 


Gly Gin 


Leu Cys 


Leu Ala 


He 


Pro Gly Gly His Arg 




910 










915 








920 


925 




tgc 


ggc 


tgc 


gec 


tea 


cac 


tac 


acc 


ctg gac 


ccc 


age age cgc aac tgc 


*S #X 

2893 


Cys 


Gly Cys 


Ala 


Ser 


His 


Tyr 


Thr 


Leu Asp 


Pro 


Ser Ser Arg Asn Cys 












930 








935 




940 




age 


ccg 


ccc 


acc 


acc 


ttc 


ttg 


ctg 


ttc age 


cag 


aaa tct gee ate agt 


2941 


Ser 


Pro 


Pro 


Thr 
945 


Thr 


Phe 


Leu 


Leu 


Phe Ser 
950 


Gin 


Lys Ser Ala He Ser 
955 


» 


c gg 


atg 


ate 


ccg 


gac 


gac 


cag 


cac 


age ccg 


gat 


etc ate ctg ccc ctg 


2989 


Arg 


Met 


He 
960 


Pro 


Asp 


Asp 


Gin 


His 
965 


Ser Pro 


Asp 


Leu He Leu Pro Leu 
970 




cat 


gga 


ctg 


agg 


aac 


gtc 


aaa 


gee 


ate gac 


tat 


gac cca ctg gac aag 


3037 


His 


Gly Leu 


Arg 


Asn 


Val 


Lys 


Ala 


He Asp 


Tyr 


Asp Pro Leu Asp Lys 






975 










980 








985 




ttc 


ate 


tac 


tgg 


gtg 


gat 


ggg 


cgc 


cag- aac 


ate 


aag cga gec aag gac 


3085 


Phe 


He 


Tyr 


Trp 


Val 


Asp 


Gly Arg Gin Asn 


He 


Lys Arg Ala Lys Asp 




990 










995 








1000 1005 




gac 


ggg 


acc 


cag 


ccc 


ttt 


gtt 


ttg 


acc tct 


ctg 


age caa ggc caa aac 


3133 
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Asp Gly Thr Gin 


Pro 


Phe 


Val 


Leu 


Thr 


Ser 


Leu 


Ser 


Gin Gly Gin 


Asn 










1010 








1015 








1020 




cca gac 


agg 


cag 


ccc 


cac 


gac 


etc 


age 


ate 


gac 


ate 


tac 


age 


egg 


aca 


3181 


Pro Asp Arg 


Gin 


Pro 


His 


Asp 


Leu 


Ser 


He 


Asp 


He 


Tyr 


Ser Arg 


Thr 








1025 








1030 








1035 




• 


ctg ttc 


tgg acg tgc 


gag 


gee 


acc 


aat 


acc 


ate 


aac 


gtc 


cac 


agg 


ctg 


3229 


Leu Phe 


Trp Thr Cys 


Glu 


Ala 


Thr 


Asn 


Thr 


He 


Asn 


Val 


His 


Arg 


Leu 






1040 








1045 








1050 






age ggg 


gaa 


gec 


atg 


ggg 


gtg 


gtg 


ctg 


cgt 


ggg 


gac 


cgc 


gac 


aag 


ccc 


3277 


Ser Gly Glu 


Ala 


Met 


Gly Val 


Val 


Leu 


Arg 


Gly 


Asp Arg 


Asp 


Lys 


Pro 




1055 








1060 








1065 










agg gec 


ate 


gtc 


gtc 


aac 


gcg 


gag 


cga 


ggg 


tac 


ctg 


tac 


ttc 


acc 


aac 


3325 


Arg Ala 


He 


Val- 


Val 


Asn 


Ala 


Glu 


Arg 


Gly 


Tyr 


Leu 


Tyr 


Phe 


Thr 


Asn 




1070 








1075 








1080 








1085 




atg cag 


gac 


egg 


gca 


gee 


aag 


ate 


gaa 


cgc 


gca 


gec 


ctg 


gac 


ggc 


acc 


3373 


Met Gin 


Asp 


Arg 


Ala 


Ala 


Lys 


He 


Glu 


Arg 


Ala 


Ala 


Leu 


Asp 


Gly 


Thr 










1090 


• 






1095 








1100 




gag cgc 


gag 


gtc 


etc 


ttc 


acc 


acc 


ggc 


etc 


ate 


cgc 


cct 


gtg 


gee 


ctg 


3421 


Glu Arg 


Glu 


Val 


Leu 


Phe 


Thr 


Thr 


Gly 


Leu 


He 


Arg 


Pro 


Val 


Ala 


Leu 








1105 








1110 








1115 






gtg gtg 


gac 


aac 


aca 


ctg 


ggc 


aag 


ctg 


ttc 


tgg 


gtg 


gac 


gcg 


gac 


ctg 


3469 


Val Val 


Asp 


Asn 


Thr 


Leu 


Gly 


Lys 


Leu 


Phe 


Trp 


Val 


Asp 


Ala 


Asp 


Leu 






1120 








1125 








1130 








aag cgc 


att 


gag 


age 


tgt 


gac 


ctg 


tea 


ggg 


gec 


aac 


cgc 


ctg 


acc 


ctg 


3517 


Lys Arg 


He 


Glu 


Ser 


Cys 


Asp 


Leu 


Ser 


Gly 


Ala 


Asn 


Arg 


Leu 


Thr 


Leu 




1135. 








1140 








1145 










gag gac 


gee 


aac 


ate 


gtg 


cag 


cct 


ctg 


ggc 


ctg 


acc 


ate 


ctt 


ggc 


aag 


3565 


Glu Asp 


Ala 


Asn 


He 


Val 


Gin 


Pro 


Leu 


Gly 


Leu 


Thr 


He 


Leu 


Gly 


Lys 




1150 








1155 








1160 








1165 




cat etc 


tac 


tgg 


ate 


gac 


cgc 


cag 


cag 


cag 


atg 


ate 


gag 


cgt 


gtg 


gag 


3613 


His Leu 


Tyr 


Trp 


He 


Asp 


Arg 


Gin 


Gin 


Gin 


Met 


lie 


Glu 


Arg 


Val 


Glu 










1170 








1175 








1180 




aag acc 


acc 


ggg 


gac 


aag 


egg 


act 


cgc 


ate 


cag 


ggc 


cgt 


gtc 


gee 


cac 


3661 


Lys Thr 


Thr 


Gly 


Asp 


Lys 


Arg 


Thr 


Arg 


He 


Gin 


Gly 


Arg 


Val 


Ala 


His 








1185 








1190 








1195 






etc act 


ggc 


ate 


cat 


gca 


gtg 


gag 


gaa 


gtc 


age 


ctg 


gag 


gag 


ttc 


tea 


3709 


Leu Thr 


Gly 


He 


His 


Ala 


Val 


Glu 


Glu 


Val 


Ser 


Leu 


Glu 


Glu 


Phe 


Ser 






1200 








1205 

* 








1210 








gee cac 


cca 


tgt 


gee 


cgt 


gac 


aat 


ggt 


ggc 


tgc 


tec 


cac 


ate 


tgt 


att 


3757 


Ala His 


Pro 


Cys 


Ala 


Arg 


Asp 


Asn 


Gly 


Gly 


Cys 


Ser 


His 


He Cys 


He 




1215 








1220 








1225 










gec aag 


ggt 


gat 


ggg 


aca 


cca 


egg 


tgc 


tea 


tgc 


cca 


gtc 


cac 


etc 


gtg 


3805 


Ala Lys 


Gly 


Asp 


Gly 


Thr 


Pro 


Arg 


Cys 


Ser 


Cys 


Pro 


Val 


His 


Leu 


Val 




1230 








1235 








1240 








1245 




etc ctg 


cag 


aac 


ctg 


ctg 


acc 


tgt 


gga 


gag 


ccg 


ccc 


acc 


tgc 


tec 


ccg 


3853 


Leu Leu 


Gin 


Asn 


Leu 


Leu 


Thr 


Cys 


Gly 


Glu 


Pro 


Pro 


Thr 


Cys 


Ser 


Pro 










1250 








1255 








1260 


V 


gac cag 


ttt 


gca 


tgt 


gec 


aca 


ggg 


gag 


ate 


gac 


tgt 


ate 


ccc 


ggg 


gec 


3901 


Asp Gin 


Phe 


Ala 


Cys 


Ala 


Thr 


Gly 


Glu 


He 


Asp 


Cys 


He 


Pro 


Gly Ala 








1265 








1270 








1275 






tgg cgc 


tgt 


gac 


ggc 


ttt 


ccc 


gag 


tgc 


gat 


gac 


cag 


age 


gac 


gag 


gag 


3949 


Trp Arg 


Cys 


Asp Gly 


Phe 


Pro 


Glu 


Cys 


Asp Asp 


Gin 


Ser 


Asp Glu Glu 





1280 1285 1290 



ggc tgc ccc gtg tgc tec gee gec cag ttc ccc tgc gcg egg ggt cag 3997 

Gly Cys Pro Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin 

1295 1300 1305 

tgt gtg gac ctg cgc ctg cgc tgc gac ggc gag gca gac tgt cag gac 4045 
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Cys 


Val 


Asp Leu Arg Leu Arg 


Cys Asp Gly Glu 


Ala 


Asp Cys 


Gin 


Asp 




1310 








1315 








1320 








1325 




cgc 


tea 


gac 


gag 


gtg 


gac 


tgt 


gac 


gec 


ate 


tgc 


ctg 


ccc 


aac 


cag 


ttc 


4093 


Arg 


Ser 


Asp Glu Val Asp 


Cys 


Asp Ala 


He 


Cys 


Leu 


Pro 


Asn 


Gin 


Phe 












1330 








1335 








1340 




egg 


tgt 


gcg 


age 


ggc 


cag 


tgt 


gtc 


etc 


ate 


aaa 


cag 


cag tgc 


gac 


tec 


4141 


Arg 


Cys 


Ala 


Ser Gly 


Gin 


Cys 


Val 


Leu 


He 


Lys 


Gin 


Gin Cys Asp 


Ser 










1345 








1350 








1355 






ttc 


ccc 


gac 


tgt 


ate 


gac 


ggc 


tec 


gac 


gag 


etc 


atg 


tgt 


gaa 


ate 


ace 


4189 


Phe 


Pro 


Asp Cys 


He 


Asp 


Gly 


Ser Asp 


Glu 


Leu 


Met 


Cys 


Glu 


He 


Thr 








1360 








1365 








1370 








aag 


ccg 


ccc 


tea 


gac 


gac 


age 


ccg 


gee 


cac 


age 


agt 


gec 


ate 


ggg 


ccc 


4237 


Lys 


Pro 


Pro 


Ser 


Asp 


Asp 


Ser 


Pro 


Ala 


His 


Ser 


Ser 


Ala 


He 


Gly 


Pro 






1375 








1380 








1385 










gtc 


att 


ggc 


ate 


ate 


etc 


tct 


etc 


ttc 


gtc 


atg 


ggt 


ggt 


gtc 


tat 


ttt 


4285 


Val 


He 


Gly 


He 


He 


Leu 


Ser 


Leu 


Phe 


Val 


Met 


Gly 


Gly 


Val 


Tyr 


Phe 




1390 








1395 








1400 








1405 




gtg 


tgc 


cag 


cgc 


gtg 


gtg 


tgc 


cag 


cgc 


tat 


gcg 


ggg 


gee 


aac 


ggg 


ccc 


4333 


Val 


Cys 


Gin 


Arg 


Val 


Val 


Cys 


Gin 


Arg 


Tyr 


Ala 


Gly 


Ala 


Asn 


Gly 


Pro 












1410 








1415 








1420 




ttc 


ccg 


cac 


gag 


tat 


gtc 


age 


ggg 


ace 


ccg 


cac 


gtg 


ccc 


etc 


aat 


ttc 


4381 


Phe 


Pro 


His 


Glu 


Tyr 


Val 


Ser 


Gly 


Thr 


Pro 


His 


Val 


Pro 


Leu 


Asn 


Phe 










1425 








1430 








1435 






ata 


gec 


ccg 


ggc 


ggt 


tec 


cag 


cat 


ggc 


ccc 


ttc 


aca 


ggc 


ate 


gca 


tgc 


4429 


He 


Ala 


Pro 


Gly 


Gly 


Ser 


Gin 


His 


Gly 


Pro 


Phe 


Thr 


Gly 


He 


Ala 


Cys 








1440 








1445 








1450 








gga 


aag 


tec 


atg 


atg 


age 


tec 


gtg 


age 


ctg 


atg 


ggg 


ggc 


egg 


ggc 


ggg 


4477 


Gly Lys 


Ser 


Met 


Met 


Ser 


Ser 


Val 


Ser 


Leu 


Met 


Gly 


Gly 


Arg 


Gly 


Gly 





1455 1460 1465 



gtg ccc etc tac gac egg aac cac gtc aca ggg gee teg tec age age 4525 

Val Pro Leu Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser 

1470 1475 1480 1485 

teg tec age acg aag gee acg ctg tac ccg ccg ate ctg aac ccg ccg 4 57 3 

Ser Ser Ser Thr Lys Ala Thr Leu Tyr Pro Pro lie Leu Asn Pro Pro 

1490 1495 1500 

ccc tec ccg gee acg gac ccc tec ctg tac aac atg gac atg ttc tac 4621 
Pro Ser Pro Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr 

1505 1510 1515 

tct tea aac att ccg gee act gcg aga ccg tac agg ccc tac ate att 4 669 

Ser Ser Asn He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr He He 

1520 1525 1530 

cga gga atg gcg ccc ccg acg acg ccc tgc age acc gac gtg tgt gac 4717 
Arg Gly Met Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp 

1535 1540 1545 

age gac tac age gec age cgc tgg aag gee age aag tac tac ctg gat 47 65 

Ser Asp Tyr Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp 
1550 1555 1560 1565 

ttg aac teg gac tea gac ccc tat cca ccc cca ccc acg ccc cac age 4813 
Leu Asn Ser Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser 

1570 1575 1580 

cag tac ctg teg gcg gag gac age tgc ccg ccc teg ccc gee acc gag 4861 
Gin Tyr Leu Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu 

1585 1590 1595 

agg age tac ttc cat etc ttc ccg ccc cct ccg tec ccc tgc acg gac 4 909 

Arg Ser Tyr Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp 

1600 1605 1610 

tea tec tgacctcggc cgggccactc tggcttctct gtgcccctgt aaatagtttt 4 965 
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Ser Ser 
1615 

aaatatgaac aaagaaaaaa atatatttta tgatttaaaa aataaatata attgggattt 
taaaaacatg agaaatgtga actgtgatgg ggtgggcagg gctgggagaa ctttgtacag 
tggagaaata tttataaact taattttgta aaaca 



<210> 2 

<211> 5120 

<212> DNA 

<213> Homo sapiens 



<400> 2 

actaaagcgc cgccgccgcg ccatggagcc cgagtgagcg cggcgcgggc ccgtccggcc 60 
gccggacaac atg gag gca gcg ccg ccc ggg ccg ccg tgg ccg ctg ctg 109 

Met. Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu 

15 10 



ctg 


ctg 


ctg 


ctg 


ctg 


ctg 


ctg 


gcg 


ctg 


tgc 


ggc 


tgc 


ccg 


gec 


ccc 


gee 


157 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Ala 


Leu 


Cys 


Gly 


Cys 


Pro 


Ala 


Pro 


Ala 






15 










20 










25 












gcg 


gec 


teg 


ccg 


etc 


ctg 


eta 


ttt 


gec 


aac 


cgc 


egg 


gac 


gta 


egg 


ctg 


205 


Ala 


Ala 


Ser 


Pro 


Leu 


Leu 


Leu 


Phe 


Ala 


Asn 


Arg 


Arg 


Asp 


Val 


Arg 


Leu 




30 










35 










40 










45 




gtg 


gac 


gee 


ggc 


gga 


gtc 


aag 


ctg 


gag 


tec 


ace 


ate 


gtg 


gtc 


age 


ggc 


253 


Val 


Asp 


Ala 


Gly Gly 


Val 


Lys 


Leu 


Glu 


Ser 


Thr 


He 


Val 


Val 


Ser 


Gly 












50 










55 










60 






ctg 


gag 


gat 


gcg 


gec 


gca 


gtg 


gac 


ttc 


cag 


ttt 


tec 


aag 


gga 


gec 


gtg 


301 


Leu 


Glu 


Asp 


Ala 


Ala 


Ala 


Val 


Asp 


Phe 


Gin 


Phe 


Ser 


Lys 


Gly 


Ala 


Val 










65 










70 










75 








tac 


tgg 


aca 


gac 


gtg 


age 


gag 


gag 


gee 


ate 


aag 


cag 


ace 


tac 


ctg 


aac 


349 


Tyr 


Trp 


Thr 


Asp Val 


Ser 


Glu 


Glu 


Ala 


He 


Lys 


Gin 


Thr 


Tyr 


Leu 


Asn 








80 










85 










90 










cag 


acg 


ggg 


gec 


gec 


gtg 


cag 


aac 


gtg 


gtc 


ate 


tec 


ggc 


ctg 


gtc 


tct 


397 


Glh 


Thr 


Gly 


Ala 


Ala 


Val 


Gin 


Asn 


Val 


Val 


He 


Ser 


Gly 


Leu 


Val 


Ser 






95 










100 










105 












ccc 


gac 


ggc 


etc 


gee 


tgc 


gac 


tgg 


gtg 


ggc 


aag 


aag 


ctg 


tac 


tgg 


acg 


445 


Pro 


Asp 


Gly 


Leu 


Ala 


Cys 


Asp 


Trp 


Val 


Gly 


Lys 


Lys 


Leu 


Tyr 


Trp 


Thr 




110 










115 










120 










125 




gac 


tea 


gag 


ace 


aac 


cgc 


ate 


gag 


gtg 


gee 


aac 


etc 


aat 


ggc 


aca 


tec 


493 


Asp 


Ser 


Glu 


Thr 


Asn 


Arg 


He 


Glu 


Val 


Ala 


Asn 


Leu 


Asn 


Gly 


Thr 


Ser 












130 










135 










140 






egg 


aag 


gtg 


etc 


ttc 


tgg 


cag 


gac 


ctt 


gac 


cag 


ccg 


agg 


gee 


ate 


gee 


541 


Arg 


Lys 


Val 


Leu 


Phe 


Trp 


Gin 


Asp 


Leu 


Asp 


Gin 


Pro 


Arg 


Ala 


He 


Ala 










145 










150 










155 








ttg 


gac 


ccc 


get 


cac 


ggg 


tac 


atg 


tac 


tgg 


aca 


gac 


tgg 


gtt 


gag 


acg 


589 


Leu 


Asp 


Pro 


Ala 


His 


Gly 


Tyr 


Met 


Tyr 


Trp 


Thr 


Asp 


Trp 


Val 


Glu 


Thr 








160 










165 










170 










ccc 


egg 


att 


gag 


egg 


gca 


ggg 


atg 


gat 


ggc 


age 


ace 


egg 


aag 


ate 


att 


637 


Pro 

• 


Arg 


He 


Glu Arg 


Ala 


Gly 


Met 


Asp 


Gly 


Ser 


Thr 


Arg 


Lys 


He 


He 






175 










180 










185 












gtg 


gac 


teg 


gac 


att 


tac 


tgg 


ccc 


aat 


gga 


ctg 


ace 


ate 


gac 


ctg 


gag 


685 


Val 


Asp 


Ser 


Asp 


He 


Tyr 


Trp 


Pro 


Asn 


Gly 


Leu 


Thr 


lie 


Asp 


Leu 


Glu 




190 










195 










200 










205 




gag 


cag 


aag 


etc 


tac 


tgg 


get 


gac 


gee 


aag 


etc 


age 


ttc 


ate 


cac 


cgt 


733 


Glu 


Gin 


Lys 


Leu Tyr 


Trp 


Ala 


Asp 


Ala 


Lys 


Leu 


Ser 


Phe 


He 


His 


Arg 





5025 
5085 
5120 
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210 



gcc aac 


ctg 


gac 


ggc 


teg 


ttc 


egg 


cag 


Ala Asn 


Leu 


Asp 


Gly 


Ser 


Phe Arg Gin 






225 










230 


acg cac 


ccc 


ttc 


gcc 


ctg 


aca 


etc 


tec 


Thr His 


Pro 


Phe 


Ala 


Leu 


Thr 


Leu 


Ser 




240 










245 




gac tgg 


cag 


acc 


cgc 


tec 


ate 


cat 


gcc 


Asp Trp 


Gin 


Thr 


Arg 


Ser 


He 


His 


Ala 


255 










260 






aag agg 


aag 


gag 


ate 


ctg 


agt 


gcc 


etc 


Lys Arg 


Lys 


Glu 


He 


Leu 


Ser 


Ala 


Leu 


270 








275 








gtg ctg 


age 


cag 


gag 


egg 


cag 


cct 


ttc 


Val Leu 


Ser 


Gin 


Glu 


Arg 


Gin 


Pro 


Phe 








290 










gac aat 


ggc 


ggc 


tgc 


tec 


cac 


ctg 


tgc 


Asp Asn 


Gly 


Gly 


Cys 


Ser 


His 


Leu Cys 






305 










310 


ttc tac 


aca 


tgc 


gcc 


tgc 


ccc 


acg 


ggt 


Phe Tyr 


Thr 


Cys 


Ala 


Cys 


Pro Thr Gly 




320 










325 




agg acg 


tgt 


aag 


gca 


gga 


gcc 


gag 


gag 


Arg Thr 


Cys 


Lys 


Ala 


Gly Ala Glu Glu 


335 










340 






acg gac 


eta 


egg 


agg 


ate 


teg 


ctg 


gac 


Thr Asp 


Leu 


Arg 


Arg 


He 


Ser 


Leu Asp 


350 








355 








gtg ctg 


cag 


gtg 


gac 


gac 


ate 


egg 


cac 


Val Leu 


Gin 


Val 


Asp 


Asp 


He Arg His 








370 










ccg eta 


gag 


ggc 


tat 


gtc 


tac 


tgg 


aca 


Pro Leu 


Glu 


Gly 


Tyr 


Val 


Tyr 


Trp 


Thr 






385 










390 


cgc agg 


gcg 


tac 


ctg 


gac 


ggg 


tct 


ggg 


Arg Arg 


Ala 


Tyr 


Leu 


Asp 


Gly Ser 


Gly 




400 










405 




gag ate 


aac 


gac 


ccc 


gat 


ggc 


ate 


gcg 


Glu He 


Asn 


Asp 


Pro 


Asp Gly 


He 


Ala 


415 










420 






etc tac 


tgg 


acc 


gac 


acg 


ggc 


acg 


gac 


Leu Tyr 


Trp 


Thr 


Asp 


Thr Gly Thr Asp 


4 30 








435 








aac ggc 


acc 


tec 


cgc 


aag 


ate 


ctg 


gtg 


Asn Gly 


Thr 


Ser 


Arg 


Lys 


He 


Leu 


Val 








450 










cga gcc 


ate 


gca 


ctg 


cac 


ccc 


gtg 


atg 


Arg Ala 


He 


Ala 


Leu 


His 


Pro 


Val 


Met 






465 










470 


tgg gga 


gag 


aac 


cct 


aaa 


ate 


gag 


tgt 


Trp Gly 


Glu 


Asn 


Pro 


Lys 


He 


Glu 


Cys 




480 










485 




egg cgt 


gtg 


ctg 


gtc 


aat 


gcc 


tec 


etc 


Arg Arg 


Val 


Leu 


Val 


Asn 


Ala 


Ser 


Leu 


4 95 










500 






ctg gac 


ctg 


cag 


gag 


ggg 


aag 


etc 


tac 



215 






220 






aag 


gtg 


gtg 


gag ggc age 


ctg 


781 


Lys 


Val 


Val 


Glu Gly Ser 


Leu 










235 






ggg 


gac 


act 


ctg tac tgg 


aca 


829 


Gly 


Asp 


Thr 


Leu Tyr Trp 


Thr 










250 






tgc 


aac 


aag 


cgc act ggg 


ggg 


877 


Cys 


Asn 


Lys 


Arg Thr Gly 


Gly 








265 




» 




tac 


tea 


ccc 


atg gac ate 


cag 


925 


Tyr 


Ser 


Pro 


Met Asp He 


Gin 






280 






285 




ttc 


cac 


act 


cgc tgt gag 


gag 


973 


Phe 


His 


Thr 


Arg Cys Glu 


Glu 




295 






300 






ctg 


ctg 


tec 


cca age gag 


cct 


1021 


Leu 


Leu 


Ser 


Pro Ser Glu 


Pro 










315 






gtg 


cag 


ctg 


cag gac aac 


ggc 


1069 


Val 


Gin 


Leu 


Gin Asp Asn 


Gly 










330 






gtg 


ctg 


ctg 


ctg gcc egg 


egg 


1117 


Val 


Leu 


Leu 


Leu Ala Arg 


Arg 








345 








acg 


ccg 


gac 


ttc acc gac 


ate 


1165 


Thr 


Pro 


Asp 


Phe Thr Asp 


He 






360 






365 




gcc 


att 


gcc 


ate gac tac 


gac 


1213 


Ala 


He 


Ala 


He Asp Tyr 


Asp 




375 






380 






gat 


gac 


gag 


gtg egg gcc 


ate 


1261 


Asp 


Asp 


Glu 


Val Arg Ala 


He 










395 






gcg 


cag 


acg 


ctg gtc aac 


acc 


1309 


Ala 


Gin 


Thr 


Leu Val Asn 


Thr 










410 






gtc 


gac 


tgg 


gtg gcc cga 


aac 


1357 


Val 


• 

Asp 


Trp 


Val Ala Arg 


Asn 








425 








cgc 


ate 


gag 


gtg acg cgc 


etc 


1405 


Arg 


He 


Glu 


Val Thr Arg 


Leu 






440 






445 




teg 


gag 


gac 


ctg gac gag 


ccc 


1453 


Ser 


Glu 


Asp 


Leu Asp Glu 


Pro 




455 






460 




■ 


ggc 


etc 


atg 


tac tgg aca 


gac 


1501 


Gly 


Leu 


Met 


Tyr Trp Thr 


Asp 










475 






gcc 


aac 


ttg 


gat ggg cag 


gag 


1549 


Ala 


Asn 


Leu 


Asp Gly Gin 


Glu 










490 






ggg 


tgg 


ccc 


aac ggc ctg 


gcc 


1597 


Gly 


Trp 


Pro 


Asn Gly Leu 


Ala 








505 








tgg 


gga 


gac 


gcc aag aca 


gac 


1645 
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Leu Asp 


mm 

Leu 


Gin 


Glu 


Gly Lys 


Leu 


Tyr 


Trp 


Gly Asp Ala Lys Thr Asp 




510 










515 










520 






525 




aag 


ate 


gag 


gtg 


ate 


aat 


gtt 


gat 


ggg 


acg 


aag 


agg egg acc 


etc 


ctg 


1693 


Lys 


He 


Glu 


Val 


He 


Asn Val Asp 


Gly 


mmm. 1 

Thr 


Lys Arg Arg Thr Leu Leu 












530 










mm* mm, mm 

535 






540 






gag 


gac 


mmm 

aag 


etc 


ccg 


cac 


att 


ttc 


ggg 


ttc 


acg 


ctg ctg ggg 


gac 


ttc 


1741 


Glu Asp 


Lys 


Leu 


Pro 


His 


He 


Phe Gly 


Phe 


Thr 


Leu Leu Gly Asp 


Phe 










545 










550 






555 








ate 


tac 


tgg 


mm km. 

act 


gac 


tgg 


cag 


cgc 


cgc 


age 


ate 


gag egg gtg 


cac 


aag 


1789 


He 


Tyr 


mmm 

Trp 


Thr 


m— 

Asp 


Trp Gin Arg Arg 


Ser 


He Glu Arg Val 


His 


Lys 








560 










mm -p* 

565 








570 








gtc 


aag 


gee 


age 


egg 


gac 


gtc 


ate 


att 


gac 


cag 


ctg ccc gac 


ctg 


atg 


1837 


Val 


Lys 
575 


Ala 


Ser 


Arg 


Asp 


Val 
580 


He 


He 


Asp 


Gin 


Leu Pro Asp 
585 


Leu 


Met 


■ 


ggg 


etc 


aaa 


get 


gtg 


aat 


gtg 


gec 


aag 


gtc 


gtc 


gga acc aac 


ccg 


tgt 


1885 


Gly Leu 


Lys 


Ala 


Val 


Asn 


Val 


Ala 


Lys 


Val 


Val 


Gly Thr Asn 


Pro Cys 




590 










595 










600 






605 




gcg 


gac 


agg 


aac 


ggg 


ggg 


tgc 


age 


cac 


ctg 


tgc 


ttc ttc aca 


ccc 


cac 


1933 


Ala Asp 


Arg 


Asn 


Gly 


Gly Cys 


Ser 


mm • 

His 


Leu 


Cys 


Phe Phe Thr 


Pro 


His 












610 










^m m\ mm 

615 






620 






gca 


acc 


egg 


tgt 


ggc 


tgc 


ccc 


ate 


ggc 


ctg 


gag 


ctg ctg agt 


gac 


atg 


1981 


Ala 


Thr 


Arg 


Cys 
625 


Gly 


Cys 


Pro 


He 


m. 

Gly 

^m mm. m\ 

630 


Leu 


Glu 


Leu Leu Ser 
635 


Asp 


Met 




aag 


acc 


* 

tgc 


ate 


gtg 


cct 


gag 


gec 


ttc 


ttg 


gtc 


ttc acc age 


aga 


gec 


2029 


Lys 


Thr 


Cys 


He 


Val 


Pro 


Glu 


Ala 


Phe 


Leu 


Val 


Phe Thr Ser Arg Ala 








640 










645 








650 








gec 


ate 


cac 


agg 


ate 


tec 


etc 


gag 


acc 


aat 


aac 


aac gac gtg 


gec 


ate ■ 


0\ mm M mw 9 

2077 


Ala 


He 


His 


Arg 


lie 


Ser 


Leu 


Glu 


mmm % 

Thr 


Asn 


Asn Asn Asp Val 


Ala 


He 






655 










660 










665 








ccg 


etc 


acg 


ggc 


gtc 


aag 


gag 


gee 


tea 


gee 


ctg 


gac ttt gat 


gtg 


tec 


mm* «■ m*A mm 

2125 


Pro 


Leu 


Thr 


Gly 


Val 


Lys 


Glu 


Ala 


Ser 


Ala 


Leu Asp Phe Asp 


Val 


Ser 




670 










675 










680 






685 




aac 


aac 


cac 


ate 


tac 


tgg 


aca 


gac 


gtc 


age 


ctg 


aag acc ate 


age 


cgc 


2173 


Asn 


Asn 


Hxs 

* 


mmm mt 

lie 


Tyr 
690 


Trp 


Thr 


Asp 


Val 


Ser 

^■a> mm. mm 

695 


Leu 


Lys Thr He 


Ser 
700 


Arg 




gec 


ttc 


• 

atg 


aac 


ggg 


age 


teg gtg 


gag 


cac 


gtg 


gtg gag ttt 


ggc 


ctt 


m\ mm. mm. m\ 

2221 


Ala 


Phe 


h A A— 

Met 


Asn 


Gly 


Ser 


Ser 


Val 


Glu 


His 


Val 


Val Glu Phe Gly Leu 










705 










710 






715 








gac 


tac 


ccc 


gag 


ggc 


atg 


gee 


gtt 


gac 


Aw* — - - 

tgg 


atg 


ggc aag aac 


etc 


tac 


^\ ^ f\ 
2269 


Asp 


Tyr 


Pro 


Glu 


Gly 


Met 


Ala 


Val 


Asp 


Trp 


Met Gly Lys Asn Leu 


Tyr 








720 










725 








730 








tgg 


gee 


gac 


act 


ggg 


acc 


aac 


aga 


ate 


gaa 


gtg 


gcg egg ctg 


gac 


ggg 


2317 


Trp Ala 


Asp 


Thr 


Gly 


Thr 


Asn Arg 


He 


Glu 


Val Ala Arg Leu Asp Gly 






735 










740 










745 








cag 


ttc 


egg 


caa 


gtc 


etc 


gtg 


tgg 


agg 


gac 


ttg 


gac aac ccg 


agg 


teg 


/-\ a -m mm 

2365 


Gin 


Phe 


Arg 


Gin 


Val 


Leu 


Val 


Trp Arg 


Asp 


Leu Asp Asn Pro Arg 


Ser 




750 










755 










760 






765 




ctg 


gec 


ctg 


gat 


ccc 


acc 


aag 


ggc 


tac 


ate 


tac 


tgg acc gag 


tgg 


ggc 


mm* m •** 

2413 


Leu 


Ala 


mm 

Leu 


mm 

Asp 


Pro 


Thr Lys Gly Tyr 


He 


Tyr Trp Thr Glu Trp Gly 












mmm mmm m\ 

770 










775 






780 






ggc 


aag 


ccg 


agg 


ate 


gtg 


egg 


gee 


ttc 


atg 


gac 


ggg acc aac 


tgc 


atg 


2461 


Gly Lys 


Pro 


Arg 


He 


Val Arg Ala 


Phe 


Met 


Asp Gly Thr Asn Cys Met 










785 










790 






795 








acg 


ctg 


gtg 


gac 


aag 


gtg 


ggc 


egg 


gee 


aac 


gac 


etc acc att 


gac 


tac 


2509 


Thr 


Leu 


Val 


Asp 


Lys 


Val 


Gly Arg 


Ala 


Asn 


Asp 


Leu Thr He 


Asp 


Tyr 








800 










805 








810 








get 


gac 


cag 


cgc 


etc 


tac 


tgg 


acc 


gac 


ctg 


gac 


acc aac atg 


ate 


gag 


2557 
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Ala Asp 


Gin 


Arg 


Leu 


Tyr Trp Thr Asp 


Leu 


Asp 


Thr 


Asn Met He Glu 




815 










820 








825 






teg tec 


aac 


atg 


ctg 


ggt 


cag gag 


egg 


gtc 


gtg 


att 


gee gac gat etc 


2605 


Ser Ser 


Asn 


Met 


Leu 


Gly Gin Glu Arg 


Val 


Val 


He 


Ala Asp Asp Leu 




830 








835 








840 




845 




ccg cac 


ccg 


ttc 


ggt 


ctg 


acg cag 


tac 


age 


gat 


tat 


ate tac tgg aca 


2653 


Pro His 


Pro 


Phe 


Gly 
850 


Leu 


Thr Gin 


Tyr 


Ser 
855 


Asp 


Tyr 


He Tyr Trp Thr 
860 




gac tgg 


aat 


ctg 


cac 


age 


att gag 


egg 


gee 


gac 


aag 


act age ggc egg 


2701 


Asp Trp 


Asn 


Leu 


His 


Ser 


He Glu Arg 


Ala 


Asp 


Lys 


Thr Ser Gly Arg 




< 




865 








870 








875 




aac cgc 


acc 


etc 


ate 


cag 


ggc cac 


ctg 


gac 


ttc 


gtg 


atg gac ate ctg 


2749 


Asn Arg 


Thr 


Leu 


He 


Gin Gly His 


Leu 


Asp 


Phe 


Val 


Met Asp He Leu 






880 








885 










890 




gtg ttc 


cac 


tec 


tec 


cgc cag gat 


ggc 


etc 


aat 


gac 


tgt atg cac aac 


2797 


Val Phe 


His 


Ser 


Ser 


Arg 


Gin Asp 


Gly 


Leu 


Asn 


Asp 


Cys Met His Asn 




895 










900 








905 






aac ggg 


cag 


tgt 


ggg 


cag 


ctg tgc 


ctt 


gee 


ate 


ccc 


ggc ggc cac cgc 


2845 


Asn Gly 


Gin 


Cys 


Gly 


Gin 


Leu Cys 


Leu 


Ala 


He 


Pro 


Gly Gly His Arg 




910 








915 








920 




925 




tgc ggc 


tgc 


gee 


tea 


cac 


tac acc 


ctg 


gac 


ccc 


age 


age cgc aac tgc 


2893 


Cys Gly 


Cys 


Ala 


Ser 
930 


His 


Tyr Thr 


Leu 


Asp 
935 


Pro 


Ser 


Ser Arg Asn Cys 
940 




age ccg 


ccc 


acc 


acc 


ttc 


ttg ctg 


ttc 


age 


cag 


aaa 


tct gee ate agt 


2941 


Ser Pro 


Pro 


Thr 
945 


Thr 


Phe 


Leu Leu 


Phe 
950 


Ser 


Gin 

• 


Lys 


Ser Ala He Ser 
955 




egg atg 


ate 


ccg 


gac 


gac 


cag cac 


age 


ccg 


gat 


etc 


ate ctg ccc ctg 


2989 


Arg Met 


He 
960 


Pro 


Asp 


Asp 


Gin His 
965 


Ser 


Pro 

• 


Asp 


Leu 


He Leu Pro Leu 
970 




cat gga 


ctg 


agg 


aac 


gtc 


aaa gee 


ate 


gac 


tat 


gac 


cca ctg gac aag 


3037 


His Gly 


Leu 


Arg 


Asn 


Val 


Lys Ala 


He 


Asp 


Tyr 


Asp 


Pro Leu Asp Lys 




975 










980 








985 






ttc ate 


tac 


tgg 


gtg 


gat 


ggg cgc 


cag 


aac 


ate 


aag 


cga gee aag gac 


3085 


Phe He 


Tyr 


Trp 


Val 


Asp Gly Arg Gin 


Asn 


lie 


Lys 


Arg Ala Lys Asp 




990 








995 








1000 


1005 




gac ggg 


acc 


cag 


ccc 


ttt 


gtt ttg 


acc 


tct 


ctg 


age 


caa ggc caa aac 


3133 


Asp Gly 


Thr 


Gin 


Pro 


Phe 


Val Leu 


Thr 


Ser 


Leu 


Ser 


Gin Gly Gin Asn 










1010 






1015 




1020 




cca gac 


agg 


cag 


ccc 


cac 


gac etc 


age 


ate 


gac 


ate 


tac age egg aca 


3181 


Pro Asp 


Arg 


Gin 


Pro 


His 


Asp Leu 


Ser 


He 


Asp 


He 


Tyr Ser Arg Thr 








1025 






1030 






1035 




ctg ttc 


tgg 


acg 


tgc 


gag 


gee acc 


aat 


acc 


ate 


aac 


gtc cac agg ctg 


3229 


Leu Phe 


Trp 


Thr 


Cys 


Glu 


Ala Thr 


Asn 


Thr 


He 


Asn 


Val His Arg Leu 






1040 






1045 








1050 




age ggg 


gaa 


gee 


atg 


ggg 


gtg gtg 


ctg 


cgt 


ggg 


gac 


cgc gac aag ccc 


3277 


Ser Gly 


Glu 


Ala 


Met 


Gly Val Val 


Leu 


Arg 


Gly 


Asp 


Arg Asp Lys Pro 




1055- 








1060 








1065 


■ 


agg gec 


ate 


gtc 


gtc 


aac 


gcg gag 


cga 


ggg 


tac 


ctg 


tac ttc acc aac 


3325 


Arg Ala 


He 


Val 


Val 


Asn 


Ala Glu 


Arg 


Gly 


Tyr 


Leu 


Tyr Phe Thr Asn 




1070 








1075 






1080 


1085 




atg cag 


gac 


egg 


gca 


gee 


aag ate 


gaa 


cgc 


gca 


gee 


ctg gac ggc acc 


3373 


Met Gin 


Asp 


Arg 


Ala 


Ala 


Lys He 


Glu 


Arg 


Ala 


Ala 


Leu Asp Gly Thr 










1090 






1095 




1100 




gag cgc 


gag 


gtc 


etc 


ttc 


acc acc 


ggc 


etc 


ate 


cgc 


cct gtg gee ctg 


3421 


Glu Arg 


Glu 


Val 


Leu 


Phe 


Thr Thr 


Gly 


Leu 


He Arg 


Pro Val Ala Leu 








1105 






1110 






1115 




gtg gtg 


gac 


aac 


aca 


ctg 


ggc aag 


ctg 


ttc 


tgg 


gtg 


gac gcg gac ctg 


3469 
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Val 


Val 


Asp Asn Thr 


Leu 


Gly 


Lys 


Leu 


Phe Trp Val Asp Ala Asp Leu 








1120 








1125 








1130 








aag 


cgc 


att 


gag 


age 


tgt 


gac 


ctg 


tea 


ggg 


gee 


aac 


cgc 


ctg 


acc 


ctg 


3517 


Lys 


Arg 


He 


Glu 


Ser 


Cys Asp 


Leu 


Ser 


Gly Ala 


Asn Arg 


Leu 


Thr 


Leu 






1135 








1140 








1145 










gag 


gac 


gec 


aac 


ate 


gtg 


cag 


cct 


ctg 


ggc 


ctg 


acc 


ate 


ctt 


ggc 


aag 


3565 


Glu 


Asp 


Ala 


Asn 


He 


Val 


Gin 


Pro 


Leu 


Gly Leu Thr 


He 


Leu Gly Lys 




1150 








1155 








1160 








1165 




cat 


etc 


tac 


tgg ate 


gac 


cgc 


cag 


cag 


cag 


atg 


ate 


gag 


cgt 


gtg 


gag 


3613 


His 


Leu 


Tyr 


Trp 


He 


Asp 


Arg 


Gin 


Gin 


Gin 


Met 


He 


Glu 


Arg Val 


Glu 












1170 








1175 








1180 




aag 


acc 


acc 


ggg 


gac 


aag 


egg 


act 


cgc 


ate 


cag 


ggc 


cgt 


gtc 


gee 


cac 


3661 


Lys 


Thr 


Thr 


Gly Asp 


Lys 


Arg 


Thr 


Arg 


He 


Gin 


Gly 


Arg 


Val 


Ala 


His 










1185 








1190 








1195 






etc 


act 


ggc 


ate 


cat 


gca 


gtg 


gag 


gaa 


gtc 


age 


ctg 


gag 


gag 


ttc 


tea 


3709 


Leu 


Thr 


Gly 


He 


His 


Ala 


Val 


Glu 


Glu 


Val 


Ser 


Leu 


Glu 


Glu 


Phe 


Ser 








1200 








1205 








1210 








gec 


cac 


cca 


tgt 


gee 


cgt 


gac 


aat 


ggt 


ggc 


tgc 


tec 


cac 


ate 


tgt 


att 


3757 


Ala 


His 


Pro 


Cys Ala 


Arg 


Asp 


Asn 


Gly 


Gly 


Cys 


Ser 


His 


He 


Cys 


He 






1215 








1220 








1225 










gec 


aag 


ggt 


gat 


ggg 


aca 


cca 


egg 


tgc 


tea 


tgc 


cca 


gtc 


cac 


etc 


gtg 


3805 


Ala 


Lys 


Gly 


Asp Gly 


Thr 


Pro 


Arg 


Cys 


Ser 


Cys 


Pro 


Val 


His 


Leu 


Val 




1230 








1235 








1240 








1245 




etc 


ctg 


cag 


aac 


ctg 


ctg 


acc 


tgt 


gga 


gag 


ccg 


ccc 


acc 


tgc 


tec 


ccg . 


3853 


Leu 


Leu 


Gin 


Asn 


Leu 


Leu 


Thr 


Cys 


Gly 


Glu 


Pro 


Pro 


Thr Cys 


Ser 


Pro 












1250 








1255 






* 


1260 




gac 


cag 


ttt 


gca 


tgt 


gec 


aca 


ggg 


gag 


ate 


gac 


tgt 


ate 


ccc 


ggg 


gee 


3901 


Asp 


Gin 


Phe 


Ala Cys 


Ala 


Thr 


Gly 


Glu 


He 


Asp 


Cys 


He 


Pro 


Gly 


Ala 










1265 








1270 








1275 






tgg 


cgc 


tgt 


gac 


ggc 


ttt 


ccc 


gag 


tgc 


gat 


gac 


cag 


age 


gac 


gag 


gag 


3949 


Trp 


Arg 


Cys 


Asp 


Gly 


Phe 


Pro 


Glu 


Cys 


Asp 


Asp 


Gin 


Ser Asp 


Glu 


Glu 








1280 








1285 








1290 








ggc 


tgc 


ccc 


gtg 


tgc 


tec 


gec 


gec 


cag 


ttc 


ccc 


tgc 


gcg 


egg 


ggt 


cag 


3997 


Gly 


Cys 


Pro 


Val 


Cys 


Ser 


Ala 


Ala 


Gin 


Phe 


Pro 


Cys 


Ala 


Arg 


Gly 


Gin 






1295 








1300 








1305 










tgt 


gtg 


gac 


ctg 


cgc 


ctg 


cgc 


tgc 


gac 


ggc 


gag 


gca . 


gac 


tgt 


cag 


gac 


4045 


Cys 


Val 


Asp 


Leu 


Arg 


Leu 


Arg 


Cys 


Asp 


Gly 


Glu 


Ala 


Asp 


Cys 


Gin 


Asp 




1310 








1315 








1320 








1325 




cgc 


tea 


gac 


gag 


gtg 


gac 


tgt 


gac 


gee 


ate 


tgc 


ctg 


ccc 


aac 


cag 


ttc 


4093 


Arg 


Ser 


Asp 


Glu 


Val 


Asp 


Cys 


Asp 


Ala 


He 


Cys 


Leu 


Pro 


Asn 


Gin 


Phe 












1330 








1335 








1340 




egg 


tgt 


gcg 


age 


ggc 


cag 


tgt 


gtc 


etc 


ate 


aaa 


cag 


cag 


tgc 


gac 


tec 


4141 


Arg 


Cys 


Ala 


Ser 


Gly 


Gin 


Cys 


Val 


Leu 


He 


Lys 


Gin 


Gin 


Cys 


Asp 


Ser 










1345 








1350 








1355 






ttc 


ccc 


gac 


tgt 


ate 


gac 


ggc 


tec 


gac 


gag 


etc 


atg 


tgt 


gaa 


ate 


acc 


4189 


Phe 


Pro 


Asp 


Cys 


He 


Asp 


Gly 


Ser 


Asp Glu Leu 


Met 


Cys 


Glu 


He 


Thr 








1360 








1365 


i 

• 






1370 






* 


aag 


ccg 


ccc 


tea 


gac 


gac 


age 


ccg 


gec 


cac 


age 


agt 


gee 


ate 


ggg 


ccc 


4237 


Lys 


Pro 


Pro 


Ser 


Asp 


Asp 


Ser 


Pro 


Ala 


His 


Ser 


Ser 


Ala 


He 


Gly 


Pro 






1375 








1380 








1385 










gtc 


att 


ggc 


ate 


ate 


etc 


tct 


etc 


ttc 


gtc 


atg 


ggt 


ggt 


gtc 


tat 


ttt 


4285 


Val 


lie Gly He 


He 


Leu 


Ser 


Leu 


Phe 


Val 


Met 


Gly Gly Val 


Tyr 


Phe 




1390 








1395 








1400 








1405 




gtg 


tgc 


cag 


cgc 


gtg 


gtg 


tgc 


cag 


cgc 


tat 


gcg 


ggg 


gee 


aac 


ggg 


ccc 


4333 


Val 


Cys 


Gin 


Arg 


Val 


Val 


Cys 


Gin 


Arg Tyr Ala Gly Ala Asn 


Gly 


Pro 












1410 








1415 








1420 




ttc 


ccg 


cac 


gag 


tat 


gtc 


age 


ggg 


acc 


ccg 


cac 


gtg 


ccc 


etc 


aat 


ttc 


4381 
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Phe 


Pro 


His 


Glu 


Tyr 


Val 


Ser Gly 


Thr 


Pro 


His 


Val 


Pro 


Leu 


Asn 


Phe 










1425 








1430 








1435 






ata 


gec 


ccg 


ggc 


ggt 


tec 


cag 


cat 


ggc 


ccc 


ttc 


aea 


ggc 


ate 


gca 


tgc 


4429 


lie 


Ala 


Pro Gly 


Gly 


Ser 


Gin 


His 


Gly 


Pro 


Phe 


Thr Gly He 


Ala 


Cys 








1440 








1445 








1450 




• 


gga 


aag 


tec 


atg 


atg 


age 


tec 


gtg 


age 


ctg 


atg ggg ggc 


egg 


ggc 


ggg 


4477 


Gly 


Lys 


Ser 


Met 


Met 


Ser 


Ser 


Val 


Ser 


Leu 


Met 


Gly Gly 


Arg 


Gly Gly 






1455 








1460 








1465 










gtg 


ccc 


etc 


tac 


gac 


egg 


aac 


cac 


gtc 


aca 


ggg 


gec 


teg 


tec 


age 


age 


4525 


Val 


Pro 


Leu 


Tyr 


Asp 


Arg Asn 


His 


Val 


Thr 


Gly Ala 


Ser 


Ser 


Ser 


Ser 




1470 








1475 








1480 








1485 




teg 


tec 


age 


acg 


aag 


gee 


acg 


ctg 


tac 


ccg 


ccg 


ate 


ctg 


aac 


ccg 


ccg 


4573 


Ser 


Ser 


Ser 


Thr 


Lys 


Ala 


Thr 


Leu 


Tyr 


Pro 


Pro 


He 


Leu 


Asn 


Pro 


Pro 












1490 








1495 








1500 




ccc 


tec 


ccg 


gec 


acg 


gac 


ccc 


tec 


ctg 


tac 


aac 


atg 


gac 


atg 


ttc 


tac 


4621 


Pro 


Ser 


Pro 


Ala 


Thr 


Asp 


Pro 


Ser 


Leu 


Tyr 


Asn 


Met 


Asp 


Met 


Phe 


Tyr 










1505 








1510 








1515 




■ 


tct 


tea 


aac 


att 


ccg 


gee 


act 


gcg 


aga 


ccg 


tac 


agg 


ccc 


tac 


ate 


att 


4669 


Ser 


Ser 


Asn 


He 


Pro 


Ala 


Thr 


Ala 


Arg 


Pro 


Tyr 


Arg 


Pro 


Tyr 


He 


He 








1520 








1525 








1530 








cga 


gga 


atg 


gcg 


ccc 


ccg 


acg 


acg 


ccc 


tgc 


age 


ace 


gac 


gtg 


tgt 


gac 


4717 


Arg 


Gly 


Met 


Ala 


Pro 


Pro 


Thr 


Thr 


Pro 


Cys 


Ser 


Thr 


Asp 


Val 


Cys 


Asp 






1535 








1540 








1545 










age 


gac 


tac 


age 


gee 


age 


cgc 


tgg 


aag 


gec 


age 


aag 


tac 


tac 


ctg 


gat 


4765 


Ser 


Asp 


Tyr 


Ser 


Ala 


Ser 


Arg 


Trp 


Lys 


Ala 


Ser 


Lys 


Tyr 


Tyr 


Leu 


Asp 




1550 








1555 








1560 








1565 




ttg 


aae 


teg 


gac 


tea 


gac ccc 


tat 


cca 


ccc 


cca 


ccc 


acg 


ccc 


cac 


age 


4813 


Leu 


Asn 


Ser 


Asp 


Ser 


Asp 


Pro 


Tyr 


Pro 


Pro 


Pro 


Pro 


Thr 


Pro 


His 


Ser 












1570 








1575 








1580 




cag 


tac 


ctg 


teg 


gcg 


gag 


gac 


age 


tgc 


ccg 


ccc 


teg 


ccc 


gee 


ace 


gag 


4861 


Gin 


Tyr 


Leu 


Ser 


Ala 


Glu Asp Ser 


Cys 


Pro 


Pro 


Ser 


Pro 


Ala 


Thr 


Glu 










1585 








1590 








1595 






agg 


age 


tac 


ttc 


cat 


etc 


ttc 


ccg 


ccc 


cct 


ccg 


tec 


ccc 


tgc 


acg 


gac 


4909 


Arg 


Ser 


Tyr 


Phe 


His 


Leu 


Phe 


Pro 


Pro 


Pro 


Pro 


Ser 


Pro 


Cys 


Thr 


Asp 





1600 1605 1610 



tea tec tgacctcggc cgggccactc tggcttctct gtgcccctgt aaatagtttt 4 965 
Ser Ser 
1615 

aaatatgaac aaagaaaaaa atatatttta tgatttaaaa aataaatata attgggattt 5025 

taaaaacatg agaaatgtga actgtgatgg ggtgggcagg gctgggagaa ctttgtacag 5085 

tggagaaata tttataaact taattttgta aaaca 5120 



<210> 3 
<211> 1615 
<212> PRT 

<213> Homo sapiens 



<400> 3 

Met Glu Ala Ala Pro Pro Gly Pro 

i 5 
Leu Leu Leu Leu Ala Leu Cys Gly 

20 

Pro Leu Leu Leu Phe Ala Asn Arg 
35 40 
Gly Gly Val Lys Leu Glu Ser Thr 



Pro Trp Pro Leu Leu Leu Leu Leu 

10 15 
Cys Pro Ala Pro Ala Ala Ala' Ser 

25 30 
Arg Asp Val Arg Leu Val Asp Ala 

45 

He Val Val Ser Gly Leu Glu Asp 
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50 










55 






60 


Ala 


Ala 


Ala 


Val 


Asp 


Phe 


Gin 


Phe 


Ser 


Lys Gly Ala Val Tyr Trp Thr 


65 










70 








75 80 


Asp 


Val 


Ser 


Glu 


Glu 


Ala 


He 


Lys 


Gin 


Thr Tyr Leu Asn Gin Thr Gly 










85 










90 95 


Ala 


Ala 


Val 


Gin 


Asn 


Val 


Val 


He 


Ser 


Gly Leu Val Ser Pro Asp Gly 








100 










105 


110 


Leu 


Ala 


Cys 


Asp 


Trp 


Val 


Gly 


Lys 


Lys 


Leu Tyr Trp Thr Asp Ser Glu 






115 










120 




125 


Thr 


Asn 


Arg 


He 


Glu 


Val 


Ala 


Asn 


Leu 


Asn Gly Thr Ser Arg Lys Val 




130 










135 






140 


Leu 


Phe 


Trp 


Gin 


Asp 


Leu 


Asp 


Gin 


Pro 


Lys Ala He Ala Leu Asp Pro 


145 










150 








155 160 


Ala 


His 


Gly 


Tyr 


Met 


Tyr 


Trp 


Thr 


Asp 


Trp Gly Glu Thr Pro Arg He 










165 










170 175 


Glu 


Arg 


Ala 


Gly 


Met 


Asp 


Gly 


Ser 


Thr 


Arg Lys He He Val Asp Ser 








180 










185 


190 


Asp 


He 


Tyr 


Trp 


Pro 


Asn 


Gly 


Leu 


Thr 


He Asp Leu Glu Glu Gin Lys 






195 










200 




205 


Leu 


Tyr 


Trp 


Ala 


Asp 


Ala 


Lys 


Leu 


Ser 


Phe lie His Arg Ala Asn Leu 




210 










215 






220 


Asp 


Gly 


Ser 


Phe 


Arg 


Gin 


Lys 


Val 


Val 


Glu Gly Ser Leu Thr His Pro 


225 










230 








235 240 


Phe 


Ala 


Leu 


Thr 


Leu 


Ser Gly 


Asp 


Thr 


Leu Tyr Trp Thr Asp Trp Gin 










245 










250 255 


Thr 


Arg 


Ser 


He 


His 


Ala Cys 


Asn 


Lys 


Arg Thr Gly Gly Lys Arg Lys 






• 


260 










265 


270 


Glu 


He 


Leu 


Ser 


Ala 


Leu 


Tyr 


Ser 


Pro 


Met Asp He Gin Val Leu Ser 






275 










280 




285 


Gin 


Glu 


Arg 


Gin 


Pro 


Phe 


Phe 


His 


Thr 


Arg Cys Glu Glu Asp Asn Gly 




290 










295 






300 


Gly 


Trp 


Ser 


His 


Leu 


Cys 


Leu 


Leu 


Ser 


Pro Ser Glu Pro Phe Tyr Thr 


305 










310 








315 320 


Cys 


Ala 


Cys 


Pro 


Thr 


Gly Val 


Gin 


Met 


Gin Asp Asn Gly Arg Thr Cys 










325 










330 335 


Lys 


Ala 


Gly 


Ala 


Glu 


Glu 


Val 


Leu 


Leu 


Leu Ala Arg Arg Thr Asp Leu 








340 










345 


350 


Arg 


Arg 


He 


Ser 


Leu 


Asp 


Thr 


Pro 


Asp 


Phe Thr Asp He Val Leu Gin 






355 










360 




365 


Val 


Asp 


Asp 


He 


Arg 


His 


Ala 


He 


Ala 


He Asp Tyr Asp Pro Leu Glu 




370 










375 






380 


Gly 


Tyr 


Val 


Tyr 


Trp 


Thr 


Asp 


Asp 


Glu 


Val Arg Ala He Arg Arg Ala 


385 










390 








395 400 


Tyr 


Leu 


Asp 


Gly 


Ser 


Gly Ala 


Gin 


Thr 


Leu Val Asn Thr Glu He Asn 










405 










410 415 


Asp 


Pro 


Asp 


Gly 


He 


Ala 


Val 


Asp 


Trp 


Val Ala Arg Asn Leu Tyr Trp 








420 










425 


430 


Thr 


Asp 


Thr 


Gly 


Thr 


Asp Arg 


He 


Glu 


Val Thr Arg Leu Asn Gly Thr 






435 










440 




445 


Ser 


Arg 


Lys 


He 


Leu 


Val 


Ser 


Glu 


Asp 


Leu Asp Glu Pro Arg Ala He 




450 










455 






4 60 


Ala 


Leu 


His 


Pro 


Val 


Met 


Gly 


Leu 


Met 


Tyr Trp Thr Asp Trp Gly Glu 


465 










470 








475 480 


Asn 


Pro 


Lys 


He 


Glu 


Cys 


Ala 


Asn 


Leu 


Asp Gly Gin Glu Arg Arg Val 










485 










490 495 


Leu 


Val 


Asn 


Ala 


Ser 


Leu Gly 


Trp 


Pro 


Asn Gly Leu Ala Leu Asp Leu 








500 










505 


510 
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Gin Glu 


Gly 


Lys 




515 




Val He 


Asn 


Val 


530 






Leu Pro 


His 


He 


545 






Thr Asp 


Trp 


Gin 


Ser Arg 


Asp 


Val 






580 


Ala Val 


Aqn 
no 1 1 


Val 

v a x 




595 




Asn Gly 


Gly 


Cys 


610 






Cys Gly 


Cys 


Pro 


625 






He Val 


Pro 


Glu 


Arg He 


Ser 


Leu 






660 


Glv Val 


Lvs 


Glu 




675 




He Tyr 


Trp 


Thr 


690 






Asn Gly 


Ser 


Ser 


705 


■ • 




Glu Gly 


Met 


Ala 


Thr Gly 


Thr 


Asn 






740 


Gin Val 


Leu 


Val 




755 




Asp Pro 


Thr 


Lys 


770 






Arg He 


Val 


Arg 


785 






Asp Lys 


Val 


Gly 


Arg Leu 


Tyr 


Trp 






820 


Met Leu 


Glv 
\j j 


Gin 




835 




Phe Gly 


Leu 


Thr 


850 






Leu His 


Ser 


He 


865 






Leu He 


Gin 


Gly 


Ser Ser 


Arg 


Gin 






900 


Cys Gly 


Gin 


Leu 


915 




Ala Ser 


His 


Tyr 


930 






Thr Thr 


Phe 


Leu 


945 






Pro Asp 


Asp 


Gin 



Leu Tyr Trp Gly 

520 

Asp Gly Thr Lys 
535 

Phe Gly Phe Thr 
550 

Arg Arg Ser He 
565 

He Tie Asp Gin 

Ala Lys Val Val 

600 

Ser His Leu Cys 
615 

He Gly Leu Glu 
630 

Ala Phe Leu Val 
645 

Glu Thr Asn Asn 

Ala Ser Ala Leu 

680 

Asp Val Ser Leu 
695 

Val Glu His Val 
710 

Val Asp Trp Met 
725 

Arg He Glu Val 

Trp Arg Asp Leu 

760 

Gly Tyr He Tyr 
775 

Ala Phe Met Asp 
790 

Arg Ala Asn Asp 
805 

Thr Asp Leu Asp 

Glu Arg Val Val 

840 

Gin Tyr Ser Asp 
855 

Glu Arg Ala Asp 
870 

His Leu Asp Phe 
885 

Asp Gly Leu Asn 

Cys Leu Ala He 

920 

Thr Leu Asp Pro 
935 

Leu Phe Ser Gin 
950 

His Ser Pro Asp 



Asp 


Ala 


Lys 


Thr 


Arg 


Arg 


Thr 


Leu 








540 




Leu Gly 


Asd 






555 




Glu 


Arg 


Val 


His 




570 






Leu 


Pro Asp 


Leu 


585 








Gly 


Thr 


Asn 


Pro 


Phe 


Phe 


Thr 


Pro 








620 


Leu 

<i-J 


Leu 


Ser 


Asd 






635 




Phe 


Thr 


Ser 


Arg 




650 




• 


Asn 


Asp Val 


Ala 


665 








Asp 


Phe Asp 


Val 


Lys 


Asn 


He 


Ser 








700 


Val 


Glu 


Phe 


Glv 






715 




Gly 


Lys Asn 


Leu 




730 






Ala 


Arg 


Leu 


Asp 


745 








Asp 


Asn 


Pro 


Arg 


Trp 


Thr 


Glu 


Trp 








780 


Glv 


Thr 


Asn 


Cvs 






795 




Leu 


Thr 


He 


Asp 




810 






Thr 


Asn 


Met 


He 


825 








He 


Ala 


Asp 


Asp 


Tyr 


He 


Tyr 


Trp 








860 


Lvs 


Thr 


Ser 


Glv 






875 




Val 


Met 


Asp 


He 




890 






Asp 


Cys 


Met 


His 


905 








Pro 


Gly Gly 


His 


Ser 


Ser 


Arg 


Asn 








940 


Lys 


Ser 


Ala 


He 






955 




Leu 


He 


Leu 


Pro 



Asp 


Lys 


He 


Glu 


525 








Leu 


Glu 


Asp 


Lys 


Phe 


He 


Tyr 


Trp 








560 


xj jr 9 


Val 

V ox 


T.uq 


Al a 






575 




Met 


Gly 


Leu 


Lys 




590 






Cys 


Ala 


Asp 


Arg 


605 








His 


Ala 


Thr 


Arg 


Met 


Lys 


Thr 


Cys 








640 


Ala 


Ala 


lie 


nxs 






655 




He 


Pro 


Leu 


Thr 




670 






Ser 


Asn 


Asn 


His 


685 








Arg 


Ala 


Phe 


Met 


Leu 


Asp 


Tyr 


Pro 








720 


Tvr 




Ala 

nx c* 








735 




Gly 


Gin 


Phe 


Arg 




750 






Ser 


Leu 


Ala 


Leu 


765 








Gly 


Gly 


Lys 


Pro 


Met 


Thr 


Leu 


Val 








800 


Tvr 
x yx 


Ala 


Asn 


Gin 






815 




Glu 


Ser 


Ser 


Asn 




830 






Leu 


Pro 


His 


Pro 


845 








Thr 


Asp 


Trp 


Asn 


Arg 


Asn 


Arg 


Thr 








880 


XjC U 


Val 

V CI X 




His 
nxD 






895 




Asn 


Asn 


Gly 


Gin 




910 






Arg 


Cys 


Gly 


Cys 


925 








Cys 


Ser 


Pro 


Pro 


Ser 


Arg 


Met 


He 








960 


Leu 


His 


Gly 


Leu 
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965 970 975 

Arg Asn Val Lys Ala He Asp Tyr Asp Pro Leu Asp Lys Phe He Tyr 

980 985 990 

Trp Val Asp Gly Arg Gin Asn He Lys Arg Ala Lys Asp Asp Gly Thr 

995 1000 . 1005 

Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn Pro Asp Arg 

1010 1015 1020 

Gin Pro His Asp Leu Ser He Asp He Tyr Ser Arg Thr Leu Phe Trp 
1025 1030 1035 1040 

Thr Cys Glu Ala Thr Asn Thr He Asn Val His Arg Leu Ser Gly Glu 

1045 1050 1055 

Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro Arg Ala He 

1060 1065 1070 

Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn Met Gin Asp 

1075 1080 1085 

Arg Ala Ala Lys He Glu Arg Ala Ala Leu Asp Gly Thr Glu Arg Glu 

1090 1095 1100 

Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu Val Val Asp 
1105 1110 1115 1120 

Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg He 

1125 1130 1135 

Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala 

1140 1145 1150 

Asn He Val Gin Pro Leu Gly Leu Thr He Leu Gly Lys His Leu Tyr 

1155 1160 1165 

Trp He Asp Arg Gin Gin Gin Met He Glu Arg Val Glu Lys Thr Thr 

1170 1175 1180 

Gly Asp Lys Arg Thr Arg He Gin Gly Arg Val Ala His Leu Thr Gly 
1185 1190 1195 1200 

He His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro 

1205 1210 1215 

Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He Ala Lys Gly 

1220 1225 1230 

Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gin 

1235. 1240 1245 

Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gin Phe 

1250 1255 1260 

Ala Cys Ala Thr Gly Glu He Asp Cys He Pro Gly Ala Trp Arg Cys 
1265 1270 1275 1280 

Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu Gly Cys Pro 

1285 1290 1295 

Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin Cys Val Asp 

1300 1305 1310 

Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg Ser Asp 

1315 1320 1325 

Glu Val Asp Cys Asp Ala He Cys Leu Pro Asn Gin Phe Arg Cys Ala 

1330 1335 1340 

Ser Gly Gin Cys Val Leu He Lys Gin Gin Cys Asp Ser Phe Pro Asp 
1345 1350 1355 1360 

Cys He Asp Gly Ser Asp Glu Leu Met Cys Glu He Thr Lys Pro Pro 

1365 1370 1375 

Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro Val He Gly 

1380 1385 1390 

He lie Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin 

1395 1400 1405 

Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro Phe Pro His 
1410 1415 1420 
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Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe lie Ala Pro 
1425 1430 1435 1440 

Gly Gly Ser Gin His Gly Pro Phe Thr Gly He Ala Cys Gly Lys Ser 

1445 1450 1455 

Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu 

1460 1465 1470 

Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser 

1475 1480 1485 

Thr Lys Ala Thr Leu Tyr Pro Pro He Leu Asn Pro Pro Pro Ser Pro 

1490 " 1495 1500 

Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn 
1505 1510 1515 1520 

He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr lie He Arg Gly Met 

1525 1530 1535 

Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr 

1540 1545 1550 

Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser 

1555 1560 1565 

Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu 

1570 1575 1580 

Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr 
1585 1590 1595 1600 

Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 

1605 1610 ~ 1615 



<210> 4 
<211> 1615 
<212> PRT 

<213> Homo sapiens 



<400> 4 



Met 


Glu 


Ala 


Ala 


Pro 


1 








5 


Leu 


Leu 


Leu 


Leu 


Ala 








20 




Pro 


Leu 


Leu 


Leu 


Phe 






35 






Gly 


Gly 


Val 


Lys 


Leu 




50 








Ala 


Ala 


Ala 


Val 


Asp 


65 










Asp 


Val 


Ser 


Glu 


Glu 










85 


Ala 


Ala 


Val 


Gin 


Asn 








100 




Leu 


Ala 


Cys Asp 


Trp 






115 






Thr 


Asn 


Arg 


He 


Glu 


• > 


130 








Leu 


Phe 


Trp Gin 


Asp 


145 










Ala 


His 


Gly Tyr 


Met 










165 


Glu 


Arg 


Ala 


Gly 


Met 








180 





Pro Gly Pro Pro Trp 

10 

Leu Cys Gly Cys Pro 

25 

Ala Asn Arg Arg Asp 
40 

Glu Ser Thr He Val 
55 

Phe Gin Phe Ser Lys 
70 

Ala He Lys Gin Thr 

90 

Val Val He Ser Gly 

105 

Val Gly Lys Lys Leu 
120 

Val Ala Asn Leu Asn 
135 

Leu Asp Gin Pro Lys 
150 

Tyr Trp Thr Asp Trp 

170 

Asp Gly Ser Thr Arg 

185 



Pro 


Leu Leu Leu 


Leu 


Leu 






15 




Ala 


Pro Ala Ala 


Ala 


Ser 




30 






Val 


Arg Leu Val 


Asp 


Ala 




45 






Val 


Ser Gly Leu 


Glu 


Asp 




60 






Gly 


Ala Val Tyr 


Trp 


Thr 


75 






80 


Tyr 


Leu Asn Gin 


Thr 


Gly 






95 




Leu 


Val Ser Pro 


Asp 


Gly 




110 






Tyr 


Trp Thr Asp 


Ser 


Glu 




125 






Gly 


Thr Ser Arg 


Lys 


Val 




140 






Ala 


He Ala Leu 


Asp 


Pro 


155 






160 


Val 


Glu Thr Pro 


Arg 


He 






175 




Lys 


He He Val 


Asp 


Ser 




190 
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Asp iie 


lyr 


Trp 




195 




Leu Tyr 


Trp 


Ala 


210 






Asp Gly 


Ser 


Phe 


225 






Phe Ala 


Leu 


Thr 


Thr Arg 


Ser 


He 






260 


vjiu lie 


lieu 


oer 




275 




Gin Glu 


Arg 


Gin 


290 






Gly Trp 


Ser 


His 


305 






Cys Ala 


Cys 


Pro 


. 

Lys Ala 


Gly 


Ala 






340 


ni y MX y 


lie 


Car 

oer 




355 




Val Asp 


Asp 


He 


370 






Gly Tyr 


Val 


Tyr 


385 


■ 




Tyr Leu 


Asp 


Gly 


Asp Pro 


Asp 


Gly 






420 


inr Asp 


inr 


biy 




435 




Ser Arg 


Lys 


He 


450 






Ala Leu 


His 


Pro 


465 






Asn Pro 


Lys 


He 


Leu Val 


Asn 


Ala 






500 


Gin Glu 

uJ.il WXU 


Glv 






515 




Val He 


Asn 


Val 


530 






Leu Pro 


His 


He 


545 






Thr Asp 


Trp 


Gin 


Ser Arg 


Asp 


Val 






580 


Ala Val 

• • 


Asn 


Val 




595 




Asn Gly 


Gly 


Cys 


610 






Cys Gly 


Cys 


Pro 


625 






He Val 


Pro 


Glu 



Pro Asn Gly Leu 

200 

Asp Ala Lys Leu 
215 

Arg Gin Lys Val 
230 

Leu Ser Gly Asp 
245 

His Ala Cys Asn 

Ala Leu Tyr Ser 

280 

Pro Phe Phe His 
295 

Leu Cys Leu Leu 
310 

Thr Gly Val Gin 
325 

Glu Glu Val Leu 

Leu Asp Thr Pro 

360 

Arg His Ala He 
375 

Trp Thr Asp Asp 
390 

Ser Gly Ala Gin 
405 

He Ala Val Asp 

Thr Asp Arg He 

440 

Leu Val Ser Glu 
455 

Val Met Gly Leu 
470 

Glu Cys Ala Asn 
485 

Ser Leu Gly Trp 

Leu Tyr Trp Gly 

520 

Asp Gly Thr Lys 
535 

Phe Gly Phe Thr 
550 

Arg Arg Ser He 
565 

He He Asp Gin 

Ala Lys Val Val 

600 

Ser His Leu Cys 
615 

He Gly Leu Glu 

. 630 
Ala Phe Leu Val 



Thr 


He Asp Leu 


Ser 


Phe He 


His 






220 


vai 


Glu Gly Ser 




235 




Thr 


Leu Tyr 


Trp 




250 




Lys 


Arg Thr Gly 


265 






Pro 


Met Asp 


He 


Thr 


Arg Cys 


Glu 






300 


O v 


Pro Ser 


Glu 




J id 




Met 


Gin Asp 


Asn 




330 




Leu 


Leu Ala 


Arg 


345 






Asp 


Phe Thr 


Asp 


Ala 


He Asp 


Tyr 






380 


bill 


Val Arg 


Ala 








Thr 


Leu Val 


Asn 




410 




Trp 


Val Ala 


Arg 


425 






Glu 


Val Thr 


Arg 


Asp 


Leu Asp 


Glu 






460 


Met 


Tyr Trp 


Thr 




a 7 




Leu 


Asp Gly 


Gin 




4 90 




Pro 


Asn Gly 


Leu 


505 






Asp 


Ala Lys 


Thr 


Arg 


Arg Thr 


Leu 






540 


lieu 


Leu Gly 


Asp 




555 




Glu 


Arg Val 


His 




570 




Leu 


Pro Asp 


Leu 


585 






Gly 


Thr Asn 


Pro 


Phe 


Phe' Thr 


Pro 




. • 


620 


Leu 


Leu Ser 


Asp 




635 




Phe 


Thr Ser 


Arg 



Glu Glu Gin Lys 
205 

Arg Ala Asn Leu 

Leu Thr His Pro 

240 

Thr Asp Trp Gin 
255 

Gly Lys Arg Lys 
270 

Gin Val Leu Ser 
285 

Glu Asp Asn Gly 

Pro Phe Tyr Thr 

320 

Gly Arg Thr Cys 
335 

Arg Thr Asp Leu 
350 

He Val Leu Gin 
365 

Asp Pro Leu Glu 

He Arg Arg Ala 

400 

Thr Glu He Asn 
415 

Asn Leu Tyr Trp 
4 30 

Leu Asn Gly Thr 
445 

Pro Arg Ala He 

Asp Trp Gly Glu 

480 

Glu Arg Arg Val 
495 

Ala Leu Asp Leu 
510 

Asp Lys He Glu 
525 

Leu Glu Asp Lys 

Phe He Tyr Trp 

560 

Lys Val Lys Ala 
575 

Met Gly Leu Lys 
590 

Cys Ala Asp Arg 
605 

His Ala Thr Arg 

Met Lys Thr Cys 

640 

Ala Ala He His 
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645 






650 








655 




Arg 


He 


Ser 


Leu 


Glu Thr Asn 


Asn 


Asn 


Asp 


Val 


Ala 


He 


Pro Leu 


Thr 








660 






665 










670 




Gly 


Val 


Lys 


Glu 


Ala Ser Ala 


Leu 


Asp 


Phe 


Asp 


Val 


Ser 


Asn Asn 


His 






675 






680 










685 






He 


Tyr 


Trp 


Thr 


Asp Val Ser 


Leu 


Lys 


Asn 


He 


Ser Arg 


Ala Phe 


Met 




690 






695 










700 








Asn 


Gly 


Ser 


Ser 


Val Glu His 


Val 


Val 


Glu 


Phe 


Gly Leu 


Asp Tyr 


Pro 


705 








710 








715 






720 


Glu 


Gly 


Met 


Ala 


Val Asp Trp 


Met 


Gly 


Lys 


Asn 


Leu Tyr 


Trp Ala 


Asp 










725 






730 








735 




Thr 


Gly 


Thr 


Asn 


Arg He Glu 


Val 


Ala 


Arg 


Leu 


Asp Gly 


Gin Phe 


Arg 








740 






745 










750 




Gin 


Val 


Leu 


Val 


Trp Arg Asp 


Leu 


Asp 


Asn 


Pro 


Arg 


Ser 


Leu Ala 


Leu 






755 






7 60 










765 






Asp 


Pro 


Thr 


Lys 


Gly Tyr He 


Tyr 


Trp 


Thr 


Glu 


Trp Gly 


Gly Lys 


Pro 




770 






775 










780 








Arg 


He 


Val 


Arg 


Ala Phe Met 


Asp Gly 


Thr 


Asn 


Cys 


Met 


Thr Leu 


Val 


785 








7 90 








795 








800 


Asp 


Lys 


Val 


Gly 


Arg Ala Asn 


Asp 


Leu 


Thr 


He 


Asp Tyr 


Ala Asp 


Gin 










805 






810 








815 




Arg 


Leu 


Tyr 


Trp 


Thr Asp Leu 


Asp 


Thr 


Asn 


Met 


He 


Glu 


Ser Ser 


Asn 








820 






825 










830 




Met 


Leu 


Gly 


Gin 


Glu Arg Val 


Val 


He 


Ala 


Asp 


Asp 


Leu 


Pro His 


Pro 






835 






840 










845 






Phe 


Gly 


Leu 


Thr 


Gin Tyr Ser 


Asp Tyr 


lie 


Tyr 


Trp Thr 


Asp Trp 


Asn 




850 






855 










860 








Leu 


His 


Ser 


He 


Glu Arg Ala 


Asp 


Lys 


Thr 


Ser 


Gly Arg 


Asn Arg 


Thr 


865 








870 








875 






- 


880 


Leu 


He 


Gin 


Gly 


His Leu Asp 


Phe 


Val 


Met 


Asp 


He 


Leu 


Val Phe 


His 










885 






890 








895 




Ser 


Ser 


Arg 


Gin 


Asp Gly Leu 


Asn 


Asp 


Cys 


Met 


His 


Asn 


Asn Gly 


Gin 








900 






905 










910 




Cys 


Gly 


Gin 


Leu 


Cys Leu Ala 


He 


Pro 


Gly 


Gly 


His 


Arg 


Cys Gly 


Cys 






915 






920 










925 






Ala 


Ser 


His 


Tyr 


Thr Leu Asp 


Pro 


Ser 


Ser 


Arg 


Asn 


Cys 


Ser Pro 


Pro 




930 






935 










940 








Thr 


Thr 


Phe 


Leu 


Leu Phe Ser 


Gin 


Lys 


Ser 


Ala 


He 


Ser 


Arg Met 


lie 


945 








950 








955 








960 


Pro 


Asp 


Asp 


Gin 


His Ser Pro 


Asp 


Leu 


lie 


Leu 


Pro 


Leu 


His Gly 


Leu 










965 






970 








975 




Arg 


Asn 


Val 


Lys 


Ala He Asp 


Tyr 


Asp 


Pro 


Leu 


Asp 


Lys 


Phe He 


Tyr 








980 






985 










990 




Trp 


Val 


Asp 


Gly 


Arg Gin Asn 


He 


Lys 


Arg 


Ala 


Lys 


Asp 


Asp Gly 


Thr 






995 






1000 








1005 




Gin 


Pro 


Phe 


Val 


Leu Thr Ser 


Leu 


Ser 


Gin 


Gly 


Gin 


Asn 


Pro Asp 


Arg 




1010 




1015 








1020 




Gin 


Pro 


His 


Asp 


Leu Ser lie Asp 


He 


Tyr 


Ser 


Arg 


Thr 


Leu Phe 


Trp 


1025 






1030 








1035 






1040 


Thr 


Cys 


Glu 


Ala 


Thr Asn Thr 


He 


Asn 


Val 


His 


Arg 


Leu 


Ser Gly 


Glu 










1045 






1050 






1055 


Ala 


Met Gly 


Val 


Val Leu Arg Gly Asp 


Arg Asp 


Lys 


Pro Arg Ala 


lie 








1060 




1065 








1070 




Val 


Val 


Asn 


Ala 


Glu Arg Gly Tyr 


Leu 


Tyr 


Phe 


Thr 


Asn 


Met Gin Asp 






1075 




1080 








1085 




Arg Ala 


Ala 


Lys 


He Glu Arg 


Ala 


Ala 


Leu 


Asp 


Gly 


Thr 


Glu Arg 


Glu 



1090 1095 1100 
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Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu Val Val Asp 
1105 1110 1115 1120 

Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg He 

1125 1130 H35 

Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala 

1140 1145 H50 

Asn He Val Gin Pro Leu Gly Leu Thr He Leu Gly Lys His Leu Tyr 

1155 1160 H65 

Trp He Asp Arg Gin Gin Gin Met He Glu Arg Val Glu Lys Thr Thr 

1170 1175 1180 

Gly Asp Lys Arg Thr Arg He Gin Gly Arg Val Ala His Leu Thr Gly 
1185 1190 1195 1200 

He His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro 

1205 1210 1215 

Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He Ala Lys Gly 

1220 1225 1230 

Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gin 

1235 1240 1245 

Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gin Phe 

1250 1255 1260 

Ala Cys Ala Thr Gly Glu He Asp Cys He Pro Gly Ala Trp Arg Cys 
1265 1270 1275 1280 

Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu Gly Cys Pro 

1285 1290 1295 

Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin Cys Val Asp 

1300 1305 1310 

Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg Ser Asp 

1315 1320 1325 

Glu Val Asp Cys Asp Ala He Cys Leu Pro Asn Gin Phe Arg Cys Ala 

1330 1335 1340 

Ser Gly Gin Cys Val Leu He Lys Gin Gin Cys Asp Ser Phe Pro Asp 
1345 1350 1355 1360 

Cys He Asp Gly Ser Asp Glu Leu Met Cys Glu He Thr Lys Pro Pro 

1365 1370 1375 

Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro Val He Gly 

1380 1385 1390 

He He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin 

1395 1400 1405 

Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro Phe Pro His 

1410 1415 1420 

Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe He Ala Pro 
1425 1430 1435 1440 

Gly Gly Ser Gin His Gly Pro Phe Thr Gly He Ala Cys Gly Lys Ser 

1445 1450 1455 

Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu 

1460 1465 1470 

Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser 

1475 1480 1485 

Thr Lys Ala Thr Leu Tyr Pro Pro He Leu Asn Pro Pro Pro Ser Pro 

1490 1495 1500 

Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn 
1505 1510 1515 • 1520 . 

He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr He He Arg Gly Met 

1525 1530 1535 

Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr 

1540 1545 1550 

Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser 
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1555 1560 1565 

Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu 

1570 1575 1580 

Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr 
1585 1590 1595 160 

Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 



<210> 5 

<2U> 3096 

<212> DNA 

<213> Homo sapiens 

.<400> 5 

catcttctca cacgatctct cgcttcgcac tccttccttt gattggtttt caccatttac 60 

tcagacgacg gtccttcttc gatctttgca cattcttcta tcatctacta ccttcatacc 120 

cagctccgtc ccctaatatt catgcgcgga tggcccattc cgtggtgaaa attcccttct 180 

actctgctaa tctgctgttc tctctccctc ccgtcgggtt ctgctcctgc cacgttctcc 240 

cctctcccca ccaaaggctg ggttttcttt gtcagggctc ctttcccctt tggaagaagg 300 

ggggctgtat ggccttggtg cgaggccctc cagtgacagg atcccccatc acccagagtt 360 

ccacaggccc tggtagggag gagggggagc agaagaggag gtgccatctt tgcctgctgg 420 

ggaagggcag gggccaccca cacagagctc tcccatttgc tgtggaccct ggggccactg 480 

cccagttcct tccaaaggaa agccagctcc ccaggtggtg ggagagtgat atggcttcct 540 

cttaaactta gggaattgag tgtgtggttg cttctaagtg ccttagaagc cgggagcggc 600 

tcctggaaag agcctgcctg ccacagcggg ccttaccctg gctgtgccca cagatgtccc 660 

tggggcctgc cgctcctgcc cggctctcct ggcctccccc ggtgtgggtt gggaaaagca 720 

cagcaaatta aaaaacacct ccatctctgg cctttgaaga atgcatctga acagccgaga 780 

gtgtaaaccg tggtgaaatg tggtctttcc agtttgggga gaagcagggc agagctgggg 840 

cttttgtacc cagggtttcc aagagctcct gcctccctcg gctgggctgg ccagggcccc 900 

ccgctgggac ctccagctgt aatagggaag gttttactgg gttgctggcc actgtggact 960 

gcccctaagg gcaggtatgc ctgcctttac ccgggttccc ctcctgcctg gaagatacag 1020 

cccatgggag gcctgttgtc tgtgggatcc tccagcatca gagacactgg ggccagcgtc 1080 

tgcctggtga ggtgcaggcc tggcaggccc ggtcccccac ctgcttgagc acccacggtg 1140 

gtgggggctc gctgcctccc gagacaatct atgtcattgt tgtccaagga agctaattta 1200 

gagtagaaag ttccgtgtcc agtcccactc tgtgcgtgtg ttagcagggg actctcgggc 1260 

cggagctggg tccaccctgg tagggggact tcatggggcc tgggcgacag cactgtgtat 1320 

ttgtgtgtgt gtgtgtttgt gtgtgtgtgt gtctgaggag gtggaccagt ttctcaaaag 1380 

gcctgtgacc ccaagaacca aggaatttca gcctgggtgg atcacacctt cactggtgag 1440 

tgggacaagc tgggggccct cgccacagga gcagccaggg catggggcac agttggcctc 1500 

attcacaaaa tgggagtata agtgatccct gctctggcgg ccaggacgat gagtgggaac 1560 

acaccgtgtg ggggctgcct ggcctgggtg tgccgcgggt gtccttgttg gtgatggttc 1620 

cacctgcttg tgccaccagt gccctctggg tctcacacac aactctcttc ccagcgaagg 1680 

cccctcctgc cctcaggcct cagtgctgct tccgtctcgg aaggccccag gagctcctgc 1740 

atcctgggcg tgattcctgt gtgcctgcag accccctcgc ggctgccatc tcatcctttg 1800 • 

gtgcacctgt tggccagacc tcctggtagc gggtgctgca ctcccctgaa tgtgccgggg 1860 

cctgggggca gggacctggg ctcctccctc actgagtgga gggaactcag tgtcttggag 1920 

ttggggtgcc tgcaggctgg gtggtgcagg tgaaatgcag acctctcagc tggtgttcca 1980 

gagcagctgc cttcccccgc ccgagggact tcacccgcag cccagtcagg ggtggcgcct 2040 

gggtgcatcg cccgcaggct gggtaggggt ggagcctggg tggccctgcc tgtgagctgc 2100 

atagttgtcg cctttgaccc tgagttttct tcgttatctg tttggacctg tttggggcag 2160 

gcaggggatg agatctgaag ataaatgcct tagctgtgac catctccttt tgtgagaggt 2220 

caatgtccag ttccgctgca gttataacat cccatttttt gatttctttt tattttttcc 2280 

tttttctttt tgagatggag tctcgctctg tcacccaggc tggagtgcaa tggggtgacc 2340 

tcagctcact gcaacctcca cttctcgggt tcaagtgatt ctcctgcctc agcctcctga 2400 
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ctagcagggg 

gcaaggtttc 
gcctcggcct 
ctttaattcc 
cctggtccca 
aaattgaatc 
cttaagtttc 
tttttttttt 
gatctcggtt 
cttagcagct 
tggaggcagg 
gggcccgaag 



ttacaggcgt 
gtcatgttgg 
cccaaagtgc 
cacctgaaac 
ccattctgct 
ataaagtatt 
atccatgttg 
ttttgcagac 
cactgcaaca 
ggaactacag 
gtttcaccat 
gacccgggca 



gagccaccac 
ccaggctggt 
tgagatgaca 
ttgccgcctt 
ttctgtctca 
tgtcttttta 
tagcatgtgt 
agagtctcgc 
tctgcctcct 
gcgcgtgcca 
cttggccagg 
aagcgtggag 



gcccagctaa 
ctcaaactcc 
ggtgtgagcc 
aagcaggtcc 
atgaatttgc 
tatctggctt 
cagaatctct 
tctgtcatct 
gggtccaagc 
ccatgccttg 
ctggtctcga 
gggagg 



tttttgtatt 
tggccttaag 
accgtgcccg 
ccagtctccc 
ctaccgtaag 
atttcactta 
ctcttttttt 
agactggagt 
aattctcctg 
ctaatttttg 
attcctggtc 



tttagtagag 
tgatctgccc 
gcccagaact 
tcccctagtc 
tacctcatat 
gcataacatt 
tttttttttt 
tcagtggcac 
cctcagcctc 
tatttttatg 
ttcaccacgg 



2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3096 



<210> 6 
<211> 26928 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> unsure 

<222> (12044), (12489), (2 64 33), (264 34), (26435), (2 6436), (26439), (26441) 
<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 6 

gaagaccaag 

ggccccggac 

cggctgggag 

tagttatgaa 

taggctgaca 

agcccgccag 

gacgtgctgg 

tccggaacca 

ccttggcggg 

agcaatggcc 

gtgaacgtgt 

gcctctgccc 

ttaaaacaac 

tgtgctgcag 

tgagcagccg 

ggggcagagg 

gagctggtgg 

cggggaagat 

cagcaaaaat 

tctattttca 

acctgtcacc 

ggcagccagg 

ttcggcacct 

ccactccctt 

acacaccagg 

tttttttttt 

gtgtgatcgt 

cctctcaagt 

ggctggagtg 

attctcctgc 



ggcacacagc 
acttgttctc 
tggaggtggg 
ttatgagaca 
gggagaaagt 
ggcctcttgc 
taggagaaga 
gcaagcccaa 
tctaagtaga 
accttgcagg 
gtgttcctcc 
tcatggggtt 
aaaagcccca 
tctttgggga 
tccggtggga 
gcagcctggt 
tagaatgtac 
gaatcgagaa 
attttccctt 
agatagaata 
cactcatgcg 
gaagctccag 
gcagggcctg 
cccacatctc 
tgcactcctg 
tttttttttg 
agctcattgc 
agttagcttt 
cagtggggca 
ctcagcctcc 



gaggcagttt 
acctgtggag 
tttgaggttt 
cgaagactgc 
cccgccaggc 
ccaccggtgt 
atctcaccca 
gttggaactc 
acctctccgc 
ttcagagagg 
agatttgtga 
tatattctag 
ttgacaggcc 
aacgggctta 
gggcagttca 
cgctggattc 
aggagaggcc 
tgcaccccac 
cttccaccct 
gacccagact 
tccccacttg 
ggctcgtgag 
ctccaaccac 
accctgctcc 
cctcagggcc 
agacagggtc 
aacctcaaac 
tgttgttttg 
atcttggctc 
caagtagctg 



cagggcgggc 
ggcagagaag 
tactgtaaac 
gaaacagaca 
tcccagacgc 
cagctcagcc 
ggcacagcct 
ccagtctgct 
gcactctttc 
gcttgcagta 
cagctttgcc 
gaagaccaac 
gtgaagaatg 
cggaaagaag 
ggaagagcag 
gggggaggaa 
agccagggcc 
gagcaatggg 
gcatccagct 
cccagctctt 
cagcctcgac 
ggctgccatc 
acttcttcct 
agccacagtg 
tttgtgcttg 
tcactctgtg 
tcctgggctc 
ttttgagatg 
accacaacct 
ggattacagg 



agcctggggc 
ggaacaggga 
taaatgtgta 
cactcctcta 
cacctttgag 
actgaaccct 
ggaaggggca 
actttctaga 
ctcctttgta 
cctcacagaa 
aggctggagt 
aaaaacaaga 
ccatgaaaaa 
gacacttgag 
acatccactg 
ccacatcagg 
agctcatgtc 
aagccagtct 
ctaccagcac 
cttacacttc 
ccccttccac 
tcaggaaaga 
tgacctctca 
gtgtctctgt 
ctgttctctg 
gcccaggctg 
aagcaatcct 
ggatctcact 
ctgcctccca 
catgtgccac 



cccacggggc 
gagaagtggc 
ccctctacct 
aaagtgcctc 
tccttcaaca 
ccaggaagaa 
cagaaggggc 
acgactgtgc 
aagtggggac 
ctgagtgccc 
caggctgaac 
agacggaaaa 
tgaatggcgt 
ctgctaccaa 
aggaggcgct 
ccatgagctg 
agacctcaag 
acgatttaag 
agcctggggt 
tactactgcc 
ctgatctcat 
agcaaaagcc 
gcttccttag 
gggttctcaa 
ctgggactct 
gagtgtagtg 
cccacctcag 
ctgttgccca 
ggctcaagca 
cacgcccagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 
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ttatttttgt atttttagta gagacagggt ttcaccatgt tggtctggct ggtcttgaac 1860 

tcctggcctc agatgatcca cctgcctcgg cctcccaaag tgctgggatg acaggcatga 1920 

gcctgtctct agtagttagg actacagaga ggggccatca tgcctggtga tcctcccacc 1980 

ttttctgctc caactctttc accccactta gcctcgtggc tcactctctt acctcttcag 2040 

ctcctcagtc aggcctgagg acccctgttg aaaattgcaa accacacccc ccaccaccac 2100 

cacccactat tgccagcact ttctactcca tttctctgct ttacttttct cctttgtact 2160 

catcaccacc tgactcatta catgtttacg tatctttctt ctctccacta gcatggaagc 2220 

tccaggagag cagagagtgt agttttattc cctgatgtgt ttcctgtgcc cgtaccaggg 2280 

cctagcacac agtaggtgct cagtaaatgt gtgttggatg aacaaataca gtgaaaggat 2340 

ctgatctaca tttataaaga aggcactctg gctgctgagt ggggatgaga ctgtcaggag 2400 

gaaagaggcc cctgtggggg cctggccagc aggtgggtac aatggtagca gccaggagag 24 60 

agggcctctt ggactcaagt ggatggggcc tgctcagggc tccggccaca ggaacaaagg 2520 

gaagggggcc caggatggcc tgtcatagag gacacattac aactggccca aagttcaagt 2580 

caggtttcta aatttgggaa gggatacaga aaaactaaag actctactgg acagtcagtt 2640 

attgaaatga ttacatagaa aatgtaccaa gaattaaaaa aaaaaaaaaa aagcattatg 2700 

aaggggccac cagagactcc cagagaggaa agggactatg ggctggatgc ggtgactcac 2760 

acctataatc ccagcacttt gggaggccga ggagggtgga tcacgaggtc aggagttcaa 2820 

aaccagccta ggcaacatgg taaaaccccc gtttctacta aaaatacaaa aaattagctg 2880 

ggcatggcag catgtgcctg taatcccagc tactcgggag gctgaggcag gagagttgct 294 0 

agaacccagg aggcagaggt tgcagtgagc cgagattgag ccactatgct ccagcttggg 3000 

cgacagagca agactccgtc tctaaaaaaa agaaaaaaaa ggccagatga ggtggctcat 3060 

gcctgtaatc ccagcacttt gggaggccga ggtgggtgga tcacgaggtc aggagatcga 3120 

gaccatcctg gctaacatgg tgaaactcca tctctactta aaatacaaaa aattagccgg 3180 

gcgtggtggc gggcacctgt agtcccagct acttgggagg ctgaggcagg agaatggcgt 3240 

gaacctggga ggcggagctt gcagtgagcc gagattgcgc cactgcactc catccagcct 3300 

gggcgacaga gttagactcc gtctcaaaaa aaaaaaaaaa aaaaaaatta gctgattagt 3360 

tgggcttggt ggcgggcgcc tgtaatccca actactcggg aggctgaggc gggagaatca 34 20 

cttgaacccg ggaggcagag gttgcaatga gccgatatca cgccactaca ctccagcctg 34 80 

ggcgacagag caagactcca tctcaaaaaa gaaaaaaaaa aagaaagggg ctgtgctgtg 354 0 

gcctgggacc caaagcacac tactgcaagg tcccagggtg cctgactcca accggagcct 3600 

tgagaacatt catttgcaaa gaatgaatta aaattcagca ctattttatt ctgcaggatt 3660 

ccagcacccc aaggacagtc atttttagac ccttcagtaa cgtaataagt aaccggagga 3720 

tgtgctgagc ttccacttcc ccagacggtt gcctgtcaca gctcatcagg ccaacaaact 3780 

tttcttaggc ctcaaatttg gaaatgttca ctctcagttc gttccttaga tgcaagtcca 3840 

tcccaatgaa gtaacagggg ctcagcacct gtccaatctc attgcttccg gggacagggg 3900 

cccatgagga tgtcgtttca gcccggtgac acttgggcaa agtgcctttt ggtttccctc 3960 

ccaggctgga acgtgctggc tctgtgaagt tacgctgggc acaagagccc cccccaaccc 4020 

ggcaggactg actgctgtgg tcagaggcgc ccctggggct ttgggagcca cagaatcttc 4080 

ctgagggcag cgccggagga ggccccagtg agagtgccca ctgccaggct cattcctcag 4140 

gctgccgcag gcctctcccc aaaacaggca atgcttctca- gcaacctgcc ccaggagcag 4200 

gccagggaag gccgccatcg gcctacagtg ctgggctctg gagggcttgg ttggtaacag 4260 

gccatggttt ctatgagcca gctggggtgt gaaggacaca ggctggattc acctctctgg 4320 

gcctcagttt ctgcattcaa aaagtgggaa tcatgatatc tgctctattt cttatctctc 4380 

agtgctgatg tgaacctcca ataagacttt taaaaatact ctttctacct tacttttatt 4440 

tttcatttat tttaagataa tgtctagctg tctcacccag gctggagtgc agtggtgtga 4500 

ttacggctca ctacagcctt aacctcccag gctcaagtga tcctcctacc acagcctccc 4 560 

aagtagctgg aactacaggc atgcaccacc gcacctggat aattttttct tttgagacaa 4 620 • 

ggtttcactc tgttgcccag gctggagtgc agtggtgcac tcttggctca ctgcagcctc 4 680 

aacctccctg ggcttaggtg atcctcacac ttcagtctcc caagtagctg ggactacagg 4740 

tatgtgccag tacacccagc taatattttt gaaggatggg gtttcactat attgcccagg 4800 

ctggtcttga actccagggt ttaagcaatc taccttcctc agcctgccaa agtgctagga 4860 

ttataggtat gagccacccc ccggcctata atcctaccac tttaaaaaag cctgtaattt 4 920 

tagcacttta aaaaattttt ctaaattttt tatagagatg ggggacagct gtggtctcac 4 980 

tgtgttgccc aggctggtct tgaactccta ggatcaagcc atcctcctgg cctggcctcc 504 0 

caaagtgttg ggattataag cataagcctt accttacctt ttttttttga gttgcagttt 5100 

tgttcttgtt gctcaggctg gagtgcaatg gcaagatctt ggctcactgc aacctccacc 5160 

tcccgggttc aagcaattct cctgcctcag cctcccgagt agctgggatt acaggcatgc 5220 
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gccaccacac ccagctaatt ttgtattttt agtagagatg gggtttctct atatacctta 5280 

attttaaagc actgcattca tgtaaattgt gattaacatg gattcaagag agggagtgag 5340 

gatgaatgag ccaggcagtc acctcggctg tcaccctcca cttctctcct ccttctgaca 5400 

gtcatcgtcc atccgtttct gcagctgttt gtttgactct cctgatcatt ttgcttgcca 54 60 

cataacttgc ctcctgggaa agaatgccct gggcaggccc acatgagtag tgaaaaataa 5520 

tctgcagtga aaaataaaac taagtagtct ggtccacaga gcagtcttat tttttcactg 5580 

cagatgaagg agttgacatt caggcttcat tctcatttat aagtgtttta aagacacata 5640 

cagtggattg aacagtggcc ttcaaaaaga tgtatctaca tcctaatccc tgggacctgt 5700 

gaatgttaac caagttagga aaagggtctt cccgggtgtc attaagttag agatcttgag 57 60 

atgaggagct catcgtggat tatccaggtg gaccctgcat ccaaggacaa atggtcctta 5820 

gaaaagaaaa gcagaggctg ggcacagtgg ctcaagcctg taatcccagc actttgagag 5880 

gccgaggtgg gtggatcacc taaggtcatg agttcgagag cagcctggcc aacatgatga 5940 

aatcccatct ctactaaaaa tacaaaaatt agcaaggcat ggtggcgggt gcctataatc 6000 

ccagctactc aggaagctga ggcaggagaa tggcttgcac ctgggaggcg gaggttgcag 6060 

tgagccaaga tcgcgccact gcactccagc ctgagggaga aaagtgaaac tctgtctcat 6120 

aaaagaaaag aaaagcagac agagatctga gacagaagag gagagtgaag gaaaaaaggc 6180 

catgtgaaga tgaggcagag gttggagcca tgcagccaca agccaaggaa tacctggagc 6240 

cccagaagtt gcaagaggta ggaagaagcc tcccctagag cctccagacg gagcacagcc 6300 

ctgccaacac ctccacctca gacttctggc ctccagcact gtgagataat caactgctgt 6360 

tgttttaagc caccagattt gtggtaattt gttatggcag ccacaggaaa ctaatacagt 6420 

acctaatctt cacaaaccca tcttacagaa aaggaaactg aagtcagaga ggtagtggct 64 80 

tgtgcagtgt gttaggccat tcttgtatta ctataaagaa atacctgagg ccgggcatgg 6540 

tggctcacgc ctgtaatccc agcactttgg gaggccaagg tgagtggatc acttgaggtc 6600 

aggagttcaa gaccagcctg gacaacatgg tgaaacccca tttctactga aaatatgaaa 6660 

attagccagg catggtggcg tgcatctgta gtcccagcta ctcaggaggc tgaggcagga 6720 

gaatcacttg cgcccgggag gaggaggttg tagtgagcca agattgtgcc actgcactcc 6780 

agcctgggag acaagagaga aaccctgtct caaaataaat aaaaaacaaa taaacacctg 6840 

agactgggta gtttataaag aaaggggtta actggctccc ggttctgcag gctgtacaag 6900 

catggtgccg gcatctgctt ggttgctggg aaggcttcag ggagttttac tcatcgtgga 6960 

aggcagagcc agagcaggtg catcacacag caaaagcagg agcgagagag agagagagca 7020 

gggaggtgtg cacactttta aatgagcaga tctcacgaga actcaccatt gcaaggacag 7080 

caccaagcca cgaggggtct gcccccatga cccaaacctc ccactaggcc ccacccccaa 7140 

cattgggaat tacagttcaa catgaggttt ggggggacaa atatccaaac tatatcattc 7200 

cacccctggc cccccagatc tcatgttctt ctcacattgc aaaatatagt catgccttcc 7260 

cagtagcccc ccaaagtctt aactcatccc agcattaact caaaaatccc attcccaagt 7320 

ccaacgtctc atctgaagat gagttccttt cacctacaag actgtaaaaa tgaaaacagt 7380 

tatttactgc tgagatacaa tgggggcata ggcattaggt aaacattcct gttccaaaag 7440 

ggagaaatcg gtcaaaagaa aggggctata ggccccaagc aagtccaaaa cccagcagag 7500 

caatcattca atcttaaagc tccaaaataa cctccttaaa ctccatgtcc catagccagg 7560 

gcacactggt gcaaggggca ggctcccaag gccttgggca gctctattcc tgcggctttg 7620 

cagaattcag tccccatggc tgctcttaca gattggagat gagggcctgc ggcttttcca 7680 

ggtgcagggt gcaagctgct ggtgatctac cattctgggg tgtggatggt ggcggccccg 7740 

tcccgcagct ccactaggca ttgtcccagt ggggactcta tgtggggcct ccaaccccac 7800 

atttcccctc caatgggaag gctctgcccc tgcagcagcc ttcttcctgg gctcccaggc 7860 

tttctcatac atcctctgac atctaggtgg atggtgtcaa gcttccttca ctcttgcact 7920 

ctgcacacct acaggcttaa caccacatgg aagctgccaa ggtgtatggc tggaaccctc 7980 

tgaagcagca gcctgagctg tgactatggc cctttgagcc aaggctggag ctggaacagt 8040 • 

ctagatgcag gcagggagca gtgtcctgag gctgtgcaga gcagcagggc cctgtgcctg 8100 

gacaatgaaa ccattctttc ctcctcatcc tctgggcctg tgatgggagg gttgtggaag 8160 

atctctgaaa tgcctttgag gcctttttgc ctctgaggcc tatttcctat tgtctcagtt 8220 

attggcagtc ggctcctttt tagttatgca aatcctctag caagaggtta. ctccactgcc 8280 

ggcttgaact cctctcctga aaaagctttt tctttctttg tcacatggcc aggctgcaaa 8340 

ttttccaaac ttttatgctc tgttttacct ttaaatataa cttctaactt taattcattt 8400 

atttgctcct gcatttgagc atagggaatt caaagaagct gggccacatc ttgaatgctt 84 60 

tgctgcttca aaatttatgg ccacgcttgg tggctcacac ctgtaatccc agcactttgg 8520 

gaggcctagg tgggcagatc acgagatcag gagatcgaga ccatcctggt caacatggtg 8580 

aaacccatct ctactaaaaa tacaaaaaaa ttagcttggt gtggtggcgc agacctgtag 8640 
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tcccagctac tggagaggct gaggcaggag aattacttga acctgggagg cagaggttgc 8700 

agtgagccca gatcatgcca ctgcactcca gcctggtgac agaataagat ttgatctcga 8760 

aaggaaggaa ggaaggagga agggaagaaa tgtcttcccc ccagatgtcc tgggtcatcc 8820 

ctcttatgtt caaacttcaa cagatcccta gggcatgaaa ataatacagc caaattattt 8880 

gctaaggcat aacgaaagtg acctttgctc cagttcccaa taagttcctc atttccatct 8940 

gagactcatc accctggcct tggcttgtcc atatcactgt cagcattttg gtcacaatca 9000 

tttaaccagc taatcgggag gctgaggcaa gaggatcact tgaacccagg aggttgaggc 9060 

tgcagtgagc tgtgatcaca tcactgcagt ccagcttggg caacagagca agatcctgtc 9120 

tcaataaata aataaataaa tacataaata acttaagttt atttaaagct gcatctttgc 9180 

caccatggag aaaggccagg ccagctcctt ctctctttct gcacgtgttc ctcccacctc 9240 

agctgcctct gctcctcaag gaggaacaga gggagtagga aaggccatcc caggaggccc 9300 

agcaccccat gacctggctc tggggccttg tgggtttatg gattcccagt gctgagtcat 9360 

ccctcacagg ctcttgtggg caccttggac attggtcaga agcatgtggt ccccgggaac 9420 

acaccttttc ctgatcatct gggaagggca gcttgtgcca gcgaggccac ctgttcagcg 9480 

ccacggcccg ccagacagct gcagccacag ccttgccttt gatcagagca aacaccagac 9540 

atgtgtgtca tgcccccaac ccatctccag gggacacatg tcctttcttg ccaggcctga 9600 

gatgaacaag agagggacaa gtccccaagc ctctctctcc ttcctgcctc acccactccg 9660 

ctgttagatt ctcaaggtgg atggtgggct aactagggca accgaccatc ctggtttacc 9720 

tagaactgag ggggcatttt caggaataaa actgcaaaag tctggagcaa acaggagcaa 9780 

gttggtcact ctggggctgg tggagtcagg tttccttctg caggccccct ccccgcaagc 9840 

atgggtggaa cccaggacag gaacacagag caggccccag gaccgggctt gtcacttaca 9900 

agtctttttt tttttttttt ttttgagatg gagtcttgct ctgtcatcag ggctggagta 9960 

cagtggtgcc atcttagctc actgcaacct ctgccttctg ggttcaagtg atccccctgc 10020 

ctcagcctcc tgagtagctg ggactacagg tggcaccacc acgcccagct aattttttgt 10080 

atttctagta gagatgagat ggccaggctg gtcttgaact cctgacctca agtgatctgc 10140 

ccgccttggc ctcccaaagt gctgggatta caggtgtgag ccactgtgcc tggccccact 10200 

cacaagtctt aaaccatgcc tcagcacatc aatgccattt acaaaaaggt agagggattt 10260 

tccaggcaaa aatagatgaa agacatagga tgattgatca tgtcctgctt aaacataggt 10320 

ctgatgctat taagaattga gggctgggag cggtggctca cgcctgtaat cccagcactt 10380 

tgggaggccg aggcgggcgg atcacgaggt caggagatcg agaccatcct ggctaacacg 10440 

gtgaaacccc atctctacta aaaatacaaa aaatggccgc gcgcggtgac tcacgcctgt 10500 

aatcccagca ctttgggagg ccaaggcggg cggatcacga ggtcaggaga tcgagaccat 10560 

cctggctaac acagtgaagc cccgtctcta ctaaaaaata caaaaaaaat tagccaggca 10620 

tggtggcggg cgcctgtagt cccagcaact tgggaggctg aggcaggaga agaatggtgt 10680 

gaacctggga ggtggagctt ccagtgagcc gagatcacac cactgcactc cagcctgggc 10740 

gacagagtga aactccatct caaaaaaaaa ataaataaat aaataagaat tgttagtatt 10800 

ttgcaggtgt gacaaatgat tctgtttctg tggcagaatg ttctcaggag atctcttttg 108 60 

aactctcatg gaaagcatca tgctgttggc aacatcacat ttatttttat ttatttatta 10920 

ttttttagag acagggtctt gctctgttgc ccaggctgga gtgcagtggc acaatcacag 10980 

ctcactgcag cctcaacctc ctgggctcaa gcaatcctcc tgcctcagcc tcccaaagta 11040 

gctgggacca caggcgtgag ccactgcact cagcccaatg taccttcaat atttacattt 11100 

ctggcaaagg tagcaaaacc ttaacaaatt ttgaatctag ataataaaat tatgaggctg 11160 

ggtgcagtgg ccctgacagg gatggctcac atctgtaatc tcaacatttt gggaggccaa 11220 

ggtaggcgga tcacctgagg ccaggagttt gagaccagcc tggccaacat ggtgtaaccc 11280 

tgtctctaac aaaaatacaa aaaaattagc cagacgtggt ggtgcacgtc tgtcatccca 11340 

gctactaggg aggctgaggc aggagaattg cttgaacccg agaggcagag gttgtgatga 11400 

gccgagatcg cgtcattgca ctccagcctg ggcaaaagca agagcgaaac tctctctcca 114 60- 

aaaaataaaa aaaaaataaa ttaatgaatt aattaaaata aaataaaata atggatagtc 11520 

actgtaaaga aaaaataaat gtatatatca gccaacaagt gatggaatag agcaccccat 11580 

ctccctggct ggacagatac atcccacaac acctggaagg cggctccatg tagaactttc 11640 

tggactgctt gaggtgctgt gctggagcac ggtgacagag gagctggacc atggacctcc 11700 

cccggccccc accaagggcg aggtccccct gtggtgggtc tgagggaggc atccgtatgg 11760 

cctctgcggc ttgggcaggg aatttggggt ccaagtactt ggtgcaaagc ctggaaagag 11820 

ggtttgggtg ctgagggcat atcccctggg ccacatgggg gcagaagtgg ggccccctga 11880 

agcttggagt cctgggcagg ggcatctatt ttgctgtctg aggccttcag tacttgaagc 11940 

aaaatggagg cagaatgtcc caccttaatg cccctgattc ctccaaacca attccagaga 12000 

cagcaagggc cagaacaggg atggccctgc ccagggtcat gcancgagga agtggccagg 12060 
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ctgggatctg aacccaggct aatcccctcc cttgtcctcc tccaggccct cacccctgca 12120 

tagagccctc cagctcactc atcctcggcc agctccatct cctcagcttg taaacccccc 12180 

cgggattttc ctttcttaaa aaacaaaggc ttggccaggc acggtggctc acgcctgtac 12240 

tttgggggtg gctcccagca ctttgggagg ccaaggtggg cggatcatga ggtcaagaga 12300 

ttgagaccat tctggccagc atggtgaaac cctgtattta ctaaaaaaaa aaaaattaac 12360 

tgggcatggt ggctagctac ttaggaggct gaggcaggag aatcgcttga acctgggaga 12420 

aagaggttgc agtgagccaa gatcgcgcca ctccacttta acctggcaac agaacaagat 12480 

tccgtttcna aaaacaaaca aacaaacaaa taaacaaaaa aaggcggagc gcgatggctc 1254 0 

gcgcctgcaa tcccagcact ttgggaggct gaggcgggcg gatcacttga ggttaggagt 12 600 

ttgagaccag cttggccaac atggtgaaac cccatttcca ctaaaagtac aaaaatcagc 12660 

caggtgtggt ggtgggtgcc tgtaatccca gctactcagg aggctgaggc aggagaatcg 12720 

cttgaaccca tgacctggag gctacagtga gctgagattg cgccactgta ctccagcttg 12780 

ggcaacaaga tttgtttctc taaaaaaaaa aaaaaaaaga ctggcccttc cccttcagct 12840 

cttcctcagg gtccctgagc actctacacc cccgtctaca ctgagcactc caccctgctg 12900 

tctacactga gcactccacc ctgccatcta cactgaggac tccaccccac tgtctacact 12960 

ggctgcctcc cgccctcacc tcctgctaag gccattcccc gctgcatctg tcttctagat 13020 

tctgcagcct tcagcacgct gggcccctcc tttgtcccct tgagccacct ccagcctccc 13080 

cctgagctgc tactcctctc ccagcagcct ccacccaagc ccctccagtc cccaagctgt 13140 

cccttgcatc cagcactgcc cttccacgtg ccccttccct ccagcttcac agcagggtgg 13200 

ggcctccagg ccctgcccac tgtgcccatc cacaagttgt ggtgggagct ccgaggggag 13260 

gcaggggtgt gcatggactt gggacgtcca agtctgggac caggggcagc tggttggtgg 13320 

agtgtggagg gggataggga ctttcaggta gagaggctgt aggggcaaga tcgggacggc 13380 

ggatgtccct aaggagggct ctgacctggg aaatattgtg cagcttcctc tttgccattc 13440 

ctggagctca gacactggcc ggctctcacc ccgcccttcc tgcaggacac agctccatcc 13500 

cagtgagttc ctagtgtaga catctccagc agcacggatg ggaaaggaag tcatcaaagg 13560 

tgcccaggac cggaggcttt ttctggaggt ggcagaggag ggtgtgggtc tcagggctct 13620 

ggctgagggc aagcgtggga ggtcttaggt ctgcaccagc cccgtgaagg cccctcctgc 13680 

tccctggtgg agtcctagag ggaacagcag cccctaggct ctagcaggag tgggtagggg 13740 

cttttctggc ttcctactgt gccagcagga tagctgggcc tggcactgag cccaaagatc 13800 

acatgccggg gcattggcgc agtgaggaac agacccttgc caaagctggc aaagaagacc 13860 

ccatggggtg cagctggtga agctgagagc tcaatgtttg ggggagcctg gcaaaagggg 13920 

tcctcccctc cctctgcagg ccaggatcgc aggttttccc tacatgttgg taattctcaa 13980 

acaatcccat ggccactgga gcaaagatca cagtgggcgg cggcctcggg agcagtggac 14040 

agggcacgca gtgcctttga tgccagagcc ctcgccccaa agtcaacaaa ctctgcagcg 14100 

gactttgcac ccggactttg ttttcaccat acaaggaaag ggacagatca caggccctct 14160 

cgctgccctc gctgagccgg aagctgcagc gtgagctctc tcaagcccca tttctaggtt 14220 

ccccaggcgc acccctgagc ccctactcgc ctattaagtt ctcctaatag cccttcaagg 14280 

tcttaatgta tgtccattag acagagggga aaactgaggc gagggcaagt gacttgaccg 14340 

aggttcctcg gcgagcaggg cgtggagctg agaacctcgt tattactgct ccccacacaa 14400' 

ccctctggcc gttcttggaa gaaggctgag ccccgggggg gccagagtga cccaaacacc 14 4 60 

atgggccgcc tgcggtaaca cgtgcggcca cgaaggggca gcagtttccc gcccggccgg 14 520 

gctctctccg gcgctcagta tccgtcccag gccaagaaga agaaactcgg ggaggagggc 14 580 

ggagggggct gcgtgggagg gcgtggaaga tggacgtggc caggggagtg gcagctgcac 14 640 

acagtggatg ctgttaagat gaagggaaag aacgtgggct ccgagatcac tggacacggt 14700 

tccacctttc ttcccgctca ctgcatggcc ctgggcgggt tgttgaaccc ttggaaacct 147 60 

gtttttcctt ttttcctttt tttttgagac agggtcttgc tctgtggccc agactggagt 14 820 

gccgtggcac gatcttggct cactgctgcc tcccaggttc aagtgatcct cccagctcag 14880- 

cctcctgcgt agctgggacc ccaggtatgt gtcaccacag ccggctaatt tttgtatttt 14 940 

tttgtagaga cgggatttcg ccgtattgcc caggctggtc tcaaactcct gagttcaccg 15000 

gatcttcctg cctcagcctc ccaaagtgct gggattactg gcatgagcca ccgcacccag 15060 

cagagacctc agttttctaa cctgtgccag caggaataat gatagctgcc tagcttggct 15120 

gtgctgggaa ttaagtaaga tgaccgggta gcaaatatga agtattactg gacacagagg 15180 

gccccaggct gggttagcag cggtggtcag ggctgctgct tcctggcctg agctcgaagg 15240 

agggccctca ttaccacctg ggtgagtcct cgtccaagcc tggcactgct gcgtgggaat 15300 

aacttctgcc acccaagttg gcagattgtg tgcaaagtta agtcctgact ctgtggggtg 15360 

gacttcgagg cctcttcatc ggacctgctt ccggtgactg cattcgcacc tcctcctgtt 154 20 

cctggtttaa cacagcccag ctttcctcct gctgagccct ccctgggcct gctgtcaccc 15480 
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tcgtgccgct gtgcctcgca gtgccactcc ctgtaccctg aatactttgc cctgcctctc 15540 

cacccagctg agagtcaggg cccctgtgag gctctgccca gcccgtcctc cgggtttctg 15600 

cctctgctga gcacttccct gcatgattgc ttctgagagt ccccccagcc tgtgagcttc 15660 

tcaggactgg gacagcttct caggaccgag gcttcctggt ctgcttgcaa ttttacaggc 15720 

gggcacattt tcccttggcc aacatcagag actggacatc tgcagatctg tgctagccac 15780 

tgagcaccca ggcaccccag caggtagctc tgtaaccaac ccattctgta aagctgaggc 15840 

tcagagaggt gaagcgcctg gcctggggcc acagcctgcg tcagctgcag agccaggagc 15900 

tgagatatgc acctgcggct ctgctcacag ggtcctgcac agactgctgc tggagccacc 15960 

tatgtagagt caagagagtt catgttaact ccctctcaca tccctcagcc agggtggggg 16020 

ctgacgatag acactcaggg atggcctacc ctccccaaca acccccgtca ggtttgccgg 16080 

atctccttgg aagaaaagtt ctgggcagaa ttccaccgtt ggcctggcct acactctcct 16140 

tagtggctta ggaccctcag cggtggataa gttgtgggca gaagagatgc aatcaggatt 16200 

ctcacccact caccccttgc cagccccaat aagctcaata agctgggctc ggtctgagga 16260 

agtgtccagg aaatgtgcaa atggcctggg acagccctgt gttcctttca gtaaggttgc 16320 

tgaaggtgag gctgaaagtt ggagaaacag aagccagtgq ttatggtttt aattaagata 16380 

atggaatgta tgtatgtatg tatgtatgta tgtatgtatt tatgtattta tctttagaga 164 40 

tagagtctca ctctgttgcc caggctggaa tgcggtgaca caatcatagc tccttgcagc 16500 

ctcgacttcc tatgcccaaa tgatcctcct acctcagcct cctgagtagc tgggactaca 16560 

gacacacgcc aactatgcct agctaatttt tatttctatt ttttgtggag actgggttct 16620 

cactttgttg cccaggctgg tcttgaaccc ctagcttcaa gcaatcctcc tgcctcagcc 16680 

tcccaaagtg gagggattac aggtgtgagc caccacacct ggcctggaat ttatttgtat 16740 

tctgcttata aaattaatac attcttattg cagaaaagtt tgaaaataaa agaaaggaca 16800 

aagaacaaaa agcgtatata atttcacagc tcagatctca ctgctattaa catttttatt 16860 

tactttcagg cttttttctt tctaggtaca tatgcagaga ttattttatt ttatttattt 16920 

tattttatat tttattttat attttttatt tcattatttt attttatttt attttattat 16980 

ttttagagac agggcctcac tctgtcaccc aggctggagt acaatggagt gatcatagct 17040 

cactgcagcc tcaaacacct gggctcaagc aatcccccca ctcagccttc tgagtagttg 17100 

ggactaaagt gtgagtctgg ctaatttttt ttactttttg tattgacaga ggtctcacta 17160 

tgttgcccag gctgatctca aactcctggg ttcaagcgat cctcccacct tggactccca 17220 

aagtgctggg attacaggca tgagccacca tgcctggcct aaaatgccac tttttgtcat 17280 

ttactaaaat cccatggaca ctttgacatg tctgtattct atgctattga tctgactgtt 17340 

ggcatctaca tcattatggc catctatcat ctatcataat ccattttaac attaaaattg 17400 

tgctgctgct tagatttttc tggcctgtct cctatttgta ttcttccaga taaattttag 174 60 

aatcatttta tcaaattccc cttgcagaaa aagccctatt ggattttggt tgaaaaatac 17520 

tgaattttta cattaactta ggaaagggct gggcacggtg gctcacgcct gtaatcccta 17580 

cacttttcga ggccaaggca ggtggatcac ttgaggttgg gagtttgaga ccagcctggc 17640 

caacatggtg aaactcggtc tttactaaaa atacaaaaat tgccaggcgc attggctcac 17700 

ctgtaatccc agcactttgg gaggccgagg tgggtggatc acgaggtcag gagatagaga 17760 

ccatcctggc taacacggtg caaccccgtc tctcctaaaa atacaaaaaa ttagccaggc 17820 

gtggtggtgg gcgcctgtgg tctcagctac ttaggaggct gaggcaggag aatggtgtga 17880 

acccaggagg cggagcttgc agtgagccaa gatcgcgcca ctgcactcca gcctgggcga 17940 

cagagtgaga ctccatctca aaaaaaaata ataataataa tacaaaaatt agccgggggt 18000 

cgtggcgtgc acctataatc ccagttactt gggaggctga ggcaggagaa tcgcttgaat 18060 

ccaggaggtg gaggttgcaa tgagcagaga tcgtgccact gtactccagc ctgggtgaca 18120 

gagtgacact ctgtgaaaaa aaaaaaaaaa ttctgaagga ttgagactct tagactctta 18180 

ggtcttccta tccaagagca caatatagct tttcatgtat tcaagccttt ttcaatgcat 18240 

caacagaatt ttacagtttt tttcatgata tcctgctatt tcttataaaa tgtattccta 18300- 

gatattctgc atgttttccg gttgtttgtt aataaatatt tttcatttgt cattatttcc 18360 

taattggctg ttatttgtat atatgacatc tgttgaattt tttgattact ttgaaaatgg 18420 

ccattctttt gtgttttttt ttaactttct attttgagat aattttgact tacagaagat 18480 

ttgcaaaaat agtacagaga gttcctgttt cccccttatg ttaacccagt ttctccttat 18540 

gttaacatct tacataacta cagaacaatt gtcaaatcta agaatcaacc tgggcacaat 18600 

gctattaact aaactgcaga agctgttcag atctcaccag ttcttctact gctccccttt 18660 

tctcttccag tgttcaatcc ggaatcctac attatattta gttgtcattt ctctttggtg 18720 

tcttccaatc tgtgacagtt cctcagtctt tctttgtctt tcatgacttt cattttttta 18780 

tacttttgaa aaatactggc cggttgtttt gtagaacgcc ctcagtttgg gtttgcctga 18840 

agttttttgt gattagatcg aggtcatgca ttattggaga gggtgccacc gcctcgatgt 18900 
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gcaagctcaa tgcatcatat cagagggttt gtaatgtcag tttataccgc cggagaccct 18960 

aacctggagc atttcgtgaa ggtgctgtct gccaggattc tccactagaa agttactatt 19020 

tttccctttt taattactga atgtctgagg ggaaatactt tgagactatg caaatatcct 19080 

gtttctgctt taacttcggc tcactaagtt tagcattcat ctatggatct cgcttatagc 19140 

aagtattact gtggagttct aatggtaatt ttctgtttct ctcattcctt caacctttat 19200 

taatatgctt cttcctcact tattcatttt gtttcagttg tttataccaa catggatttg 19260 

tggatattgg ttttattctt tgggttgcaa ttgaatccta tcattatttt gttagtcagt 19320 

tgttccatcc gaccttggtc attaggagcc cttgaaattt ggctcccatg cctttttttt 19380 

tttttttgag accgagtctc actctgtcac ccaggtttga gtgcagtggc atgatcttgg 19440 

cttcctgcaa cctccgcctc ccaggttcaa gcaattctcc tgcctcagcc tcctgagtag 19500 

ctggtattat aggcgctcca ccaccttgcc cggctaattt tttgtatttt tagtagagat 19560 

ggggttttat tatgttggcc aggctggtct caaactcctg acctcaggtg atctgcccgc 19620 

ctcggcctcc caaagtgctg ggactacagg cgtgagccac cacacctggc ctcctatgcc 19680 

attttaacat gcccgtcttt tctttttctt tcctactttc tgtgactgta agaagctcca 19740 

ggatacattt ttgctgccct agacttagcc tcaatcagtt ctcagaaaag ctctggttct 19800 

ttttatggga tacttagaaa actagctctg tatggcctgg cgcggtggct cacgcctgta 19860 

atcccagtac tttgggaggc cgaggtgggc agatcacaga tcacgaagtc aggagatcaa 19920 

gaccatcctg gctaacatgg tgaaactctg tctctactaa acatacaaaa aattagtcca 19980 

ggcgcggtgg cgggcgcctg tagtcccagc tactcaggag gctgaggcag gagaacggca 20040 

tgaacccggg aggcggagct tgcagtgagc cgagatcggc agccactgca ctccagcctg 20100 

ggccacagag cgagactccg tctcaaaaaa aaaaaaagga aaaagaaaaa agaaaactag 20160 

ctctgtatgc tagttttttt tttaagacag ggtctctctt' gccccagctg gagtgtagca 20220 

gcacgatcac agctcactgt agcctcaacc ttctgggctc aagcaatcct cctgcctcag 20280 

tctcctaagt agctgggtct acaggcatgc accaccgtac gtggcaattt ttaaaaactg 20340 

tttgtagaga tggagtctcc ctatgttgcc tggtctggaa ctcctggcct caagtgatcc 20400 

tcctgcctcg gcctcccaaa gtgctgagat tacaggcatg agccactgta cctggcctgg 204 60 

ccaaggtctg tcttttttta aaagaagttg ttgtatagtt gttttttttt ttattttttt 20520 

ttctgagacg gagtctcgct ctgtcgccca ggctggagtg cagtggtgcg atctcggctc 20580 

actgcaagct ccgcctccca ggttcacgcc attctcctgc ctcagcctcc cgagtagctg 2064 0 

ggcctacagg cgcccgctac cacgcccggc taattttttg catttttagt agagacgggg 20700 

tttcaccgtg ttagccagga tggtctcgat ctcctgacct cgtgatccgc ccgcctcggc 20760 

ctcccaaagt gctgggatta caggcgtgag ccaccgcgcc cggcctgttg tatagttttt 20820 

atctcgagtt ttctagcgat ttaatcatat tggttacaaa aaaggatgat tttactacct 20880 

cctttccaat gtttctacat attttttcat tttatctaac tgcattttaa aataaacttt 20940 

taattttaga atggtttcat atttacagaa aatgtgcaaa gatagtacag agagttcctg 21000 

tgtactccac acccggtttc cttattatta tcttaacgtg atacacaatt aataaaccag 21060 

taacattatt attcactgaa gtccacactt tctttttttt tttttctgag acggagtcta 21120 

cttctgtcac ccaggctgga gtgcagtggc gcaatctcgg ctcactgcaa cctccacctc 21180 

ctgggttcag gcaattctgt ggctcagcat cccaagtagc tgggaataca ggtgcccgcc 21240 

accacgcccg gctaattttt tgtattttta gtagagatgg ggtttcacca tgttagccag 21300 

gatggtcttg aactcctgac ctcgtgatct gcctgcctca gcctcccaaa gtgctgggat 21360 

tacaggcgtg agccaccgcg cccggcgtcc atactttctt tagatatcct tcctttttac 21420 

ctaacgtcct tcttctggtt caggatccca tccagaaagc aacattaccc ctcgccatca 21480 

cgtcttcaca ggctcccctt gacgggaaga gttcctcaga ctttccttgt ttttgttgac 21540 

cttgacagtt ttgaggagga ctggtatctt agtctgtttt gtgctgctat cacagactag 21600 

ctgagaccga tacatgatac atgaaaaaaa atgtattctt acagttgtgg aggctgggaa 21660 

gttcaagacg aagttgctgg ttggtttggt ctctggtttc aagatggcgc cttgctgctg 21720- 

catcctctgg agaagaagaa tgcggtgtcc tctcactgca gaagatggaa gcgctaaaag 21780 

gaatgaactc cctttgccaa gccattttat aatgggcatt aatccacaaa ggatgaaacc 2184 0 

ctgagaaaca tcaagcttta aagcactggt tctcaacctt tttggtctca ggagcccttt 21900 

atactcttaa aacgttttga ggatcccaaa aaaaggcttc tacaggttcc atcttttaat 21960 

atttaccata tcaaaaatta aactgaaaaa attttaaatt atttattcat ttaaaataac 22020 

aaggataaac ccattacatg ctaacataaa tcatgtattt tatgaaaaat agctatattt 22080 

atcaaaacaa aaattagtga gaagagtggc atgtataatt ttttttgttt attttttgtt 22140 

tttagatgga atcttattct gtcgcccagg ctggagtgca gtggtgtgat ctcggctcac 22200 

tgcaagctct gcctcccagg ttcacaccat tctcctgcct cagcctcctg agtagctggg 22260 

actgcaggtg cctgccacca cgcccggcta attttttgta tttttagtag agatggagtt 22320 
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tcaccgtgtt agccaggatg gtcttgatct cctgaccttg tgatccaccc gcctcagcct 22380 

cccaaagtgc tgggattaca ggcttgagcc actgcgtctg gcctaaattt ttgtgaatgt 224 40 

ctttaatgcc tgccttctca tatttgtttc tgcattcaag ttattgcaaa atgttgtgtt 22500 

ggttgaagtt tgtaaagaaa atgtggcctc atacagttgt gtagttggaa aggcaagagt 22560 

attttgattc tctcttcaaa caactatgga caacctgctg ttacaaaacc agaatgcaaa 22620 

aagttgtagt aaatacaggt taggtgtagt gtggaatctg aaagcatgtg aatgaacttt 22680 

ctgagttttg taacattaaa gtccagttgc gttaagctac tgtgatagca tatagcattg 22740 

tcctaatact ggaattagta tcagaagtgg ggtgctactg ttaataaata aaaagaaata 22800 

aataaatcat gtgatactgg ctcagaagtc aggcagtagg ctgtgtggaa cctgacatca 228 60 

cgccatgtaa tacattggca accatttgat ccagctgtct gtcatgatga cttggaaagt 22920 

caaccacata cttacagagc ctgtagacat aggggaaaat agtataaaac agaatactaa 22980 

cagtggacct tggttcttgc cagttgcatt tagccaaata ttaaacaaaa gagatattct 23040 

tgggcagcaa ctggaccatc ttcaagtaaa agtgaaaggt aataaacaga gtccagacat 23100 

ttgtgcccat gcgggttaag aaaaatccag ttgcttctag acaccgtata tgaaaacaac 23160 

gctgaaaaca agcctttgag tggtaaaggc cgattaacac tcagcgcggt aacaaagacc 23220 

aggtgggcta acccgaaatg aaatgagaag cctgtggtga tgaggaggca gagaagtaaa 23280 

atcaagtttg agcatttcgt ttaggagagt ttgggctctg attacttgca catgcaaacg 23340 

aactggaaac aaacagatca gatgtctacc acttcttcga gggaattgca ttgccaaaga 23400 

agtcatgaaa gcagactcta tactgattag gcattaaaac aaaaacaatc tttaggcccc 234 60 

taaacttgca tgggcaggaa gtgggctgtc aaagctgttc atcctctaag gtggacctag 23520 

ttcctagtcc ccagtataca cttcagatgt ggccctggag gacactggac atggaggacc 23580 

tcccagagga tgaggctagg gcttcatttc tccaatgacc tcagctgcct ctatttcccc 23640 

ttcttcctct ggaagtccta tcatcgttat tattattatt atcatcattt ttattttgag 23700 

ataaggtctc gctctgttgc ccaggctgga gtgcagtgac atgatcatgg ctcactgcag 23760 

ccctcccagg ctcaagtgat cctcctgcct cagcctcctg agtagctggg agtacaggca 23820 

catgccacca tgcttggcta tttttttttt cagtagagat aigggctctca ctatgttgcc 23880 

agggctgatc tcaacctcct gggttcaaga gatcctccta cctcagctcc tgagtagctg 23940 

ggattcgggt gcacaccacc atgccaacta atttttaatt tttttttgta tggacaggat 24000 

gtacagtgtt agaaatggat tgcttgcaga ggcaggagga tcacttgagc ccaggagttt 24060 

gatcacactg tgaaccatga tcgcacccct gcactccaat ctgggcaaca gagtgagacc 24120 

ttgtctcaaa aaaaaaaaaa aagagagaga gagagagact caaagatagg caaaaaagtg 24180 

ggaaagcttt atagtggaca aaaaggaacg ctctaagtct gccctattgg catggtgctg 24240 

aaggtgggct aactagagat agggggtact atgtggttga ctatgggtgc atctttggct 24300 

ttccctgggt gatcctaagt tggaagcagg gacaaaaatt agggaagctg ttagttattc 24360 

atcacgttct ggcagtagtg gactggttgt gatagaagtt attgttttgg ccaggtgcgg 24420 

tggctcatgc ctgtaatcct agccctttca gagttcaacg tgggtggatc aggaaggagg 24480 

gaggatttgg gaggtcagga gttagcctgg ctaacctggc gaaatcccat ctctactaaa 24540 

aatacaaaaa ttagctgggc gtggtggtgc atgcctataa tcccagctac tcgggacgct 24 600 

gaggcaggag aatcagttga acctggggag gcggaggttg cagtgagcca agatcgtgcc 24 660 

caatttcatc tcaaaaaaaa aaaaaaagtt atcgtttagc ttcctcgatt gttactggac 24720 

gtagtaatct ggcttcctgc aagtctaact ttcagcagac tggctacatg ggctgtgtac 24780 

tgtagataag gcagtaagta aagcaaaaat tgatagagca tcaaggataa atagaaaatc 2484 0 

cgtaatcaag cagaagattt gaacacttca ctttcagtaa ctgataaaac aagtagacaa 24 900 

aaaaaatcag taaggatgta gaagatttga acaacgtaat taacaaactt gacttgattt 24 960 

acacgtctag aaccctgcag aacacacact ttttcaagca tactcagaac atttatataa 25020 

agtgaccata tggtggacca taaagcagtt tcaacaaatc tcacaggagt aaaataacag 25080 

accgtgtttt ctgaccgtaa gtacagttaa cctagaaatt gaaaacaaaa agctagaaaa 25140- 

accccatgta tctggaaatt ttaatataca ctttgaaata acaaatggat cagagattaa 25200 

ttcaaatagg aatttagaaa taccttgaac tgaaaaataa tgagaatact ataccccaaa 25260 

actgtggggt gcagctgaac agtatataga cgaaaagtat actcatatgt gcatacctta 25320 

aggagcgggg aggattgaaa gttaatggga ggcaaaagca ggtggatcac ttgaggttag 25380 

gagttcaaga tcagcctggc taacagggtg aaaccccatc tctactaaaa atacaaaaaa 25440 

ttatccaggc gtagtgaggc tgaggcaaga gaatcgttgg aacccaggag gcagaggttg 25500 

cagtgagccg cgattgcgcc actgcacccc agcctgggag acagagcgag actccatctc 25560 

aagaaagaaa aaaaaaaaag aaaaggccag gcgcggtggc tcatgcctgt aatcccagca 25620 

ttttgggagg ccgaggtggg cggatcacga ggtcaggaga tcgagactat cctggctagc 25680 

acggtgaaac cccgcctcta ctaaaaatac aaaaaaatta gccaggcgtg gtggcgggtg 25740 
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cctgtagtcc cagctactca ggaggctgag gcaggagaat gtcatgaacc caggaggcag 25800 

agcttgcagt gagccgagat cgcgccactg tactccagcc tgggcaacag agagagactc 25860 

tgtctcaaaa aaaaaaaaaa gttaatggga taaacatcca tctcaagaag ttagaaagga 25920 

atgacaaata aaccaaaaaa aaaaaaatca aaagaagaaa atcataaggt caagactata 25980 

aagagagtgg ctgggtgcag tggctcaggc ctgtaatctc agcattttgg gaagcagagg 26040 

tgggcagatc acttgagccc aggagttcaa gaccagcctg agtaacatag agagacctca 26100 

tctttgctga aaataaaaat aaaaaattag ccaggcatgg tggtactgag gtgggaggat 26160 

cacttgagcc taggaggttg aggctgcagt aagccatgat tgtgccactg cacttcagcc 26220 

tgggtgacag agtgggaccc tgtctctaaa aaactaaaat aaggctgggc gcggtggctc 26280 

aaatctgtaa tcccaccact ttgggaggcc aaggctgagg tcagcagttt gagaacagct 26340 

tggccaacaa gatgaaacct catctctact aaaaatacaa aaaattagtt gggtgtggtg 2 6400 

gcatgtgcct gtaatcccag ctacttagga ggnnnnctnt ngattatatt ttctccttcc 264 60 

tacgtcgtta ttggactgaa ttcagaatga tgactctcat tggagctctt cctgtctcct 26520 

aactacagtg gcttccgacc ccactctggt tttcacttca cccctctgct gctcatacga 26580 

gtagatactt ccttccttct ttctcacttg ttgctcttcc tcaacccccc ccgttggtgt 26640 

cccctcctct ttatcttttt ctcgcgacac ctgcgttctc ttgccctctt atcatccctt 26700 

tctcgaggcg gtcctttcct ttatccagct taaatacctt ctcctctgtt tatttggggg 26760 

ttgggttttt atctctcacc ctccctctaa tttctttcct ctttccgcac ccatcaagcc 26820 

tctcgtggtt tctcttcctc tactctcggg tcccccccct ctccccttct ttttttcttc 26880 

acccccccaa gcgctttgcc ttttttttct ttgcccttta ttcccccc 26928 



<210> 7 

<211> 29430 

<212> DNA 

<213> Homo sapiens 



<220> 

<22fl> unsure 

<222> (4336), (4345), (4349), (4392), (4447), (44 90) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 7 

aggggaaggg ccggctccgt agctcacacc tataatccca gcactttccg aggagagagg 60 

atcatctcag gccaggagtt caagaccagc ctgggcaaca cagcaagacc gcatctctac 120 

aaaaacttct tttaaagctt aaaaaaaaaa aaaaaagcaa agaggacagt tcaggagaaa 180 

agcctgtaga ggcagcacac taaggaggag acgcagccca ggcaccagga ggggctggcc 240 

atgggcactc actcctccag caggcgagtg cccagcacca gctggcccac ccagacaccc 300 

aggacacggc ctgaatggct ccgtattcac gtgggtggta ataaacaagc aatacacata 360 

gccaataagg acaccttagt aatgttacat cataaacgct gcagatcagg gaaatggtgc 420 

agggtgaagt gggttggggg gctgcatgct acatgagaag tgggtcgggg ggctgcatgc 4 80 

tacctgagac agagcaggcc ttgctgggaa agaaggagcc ggcaggcctg ggcaaaggtc 540 

ctggggtggg agcacactgg agcagagtgt gggggtagca tggcgggtgc tggtcctctg 600 

ggcgccttcc caccacgtca tgtgcccatg tgcccaaggt ctctcgtttc acagccccct 660 

gaagctcagg ggtcacagct acacagcccc cagatacctt ggcctgcccc aggtcattcc 720 

atccagtgat ggacctgctg acctctagcc tgacctctgg gcagcgtaat ttgagaagga 780 

ggagaaggga gggcaacaga cctggggcga tgagggatgc acagggtggc agacacctga 840 

ggctgcacct tggagcctca gttctgggtg tgggtggggg atggacaggc tgagggctga 900 

agcagctggg cccggccacc atcacacccc aggacccacc agatcaccat gaaaaaccga 960 

atgtcaactg gcagcccaga gtgcagaaca aacctttcag aaacacggtg gtgactgccg 1020 

catcatgaac ataaaataat tacgccctct ccccagggat cacccctgca ggagtttgtc 1080 

ccaagaaaca ccagaaagaa ggaaaacgtc tgagtcacaa tatttgctga ggccttattt 1140 

gtaatagcaa aaaaaaaaaa aaaaaaagaa caatctccag cggcaggggt aactagacta 1200 

ttgtctccgt ggaaaggtag caccaattaa ctagtaacaa aatgactgcg gtaacaacaa 1260 

aacgttcgac atgtcaacac caaaaaccac acacccagca taaccgtgaa ccatgatttc 1320 



Page 29 



WO 02/092015 



PCTYUS02/15982 



032796-132. ST25 



tactagaatg aatggcagtt atgagaaagc accagcggag acaaagattg aaaaagtaaa 1380 

ggtggcctca ttagggagac aagtctctgg gtaatatatt gtaatactgg taaatatata 14 40 

gtttttaata tattttttaa ttccaaattc catatatgtt cctatgaagc tatttctgca 1500 

aatatttttt tcaggaccgt acatcacaaa ggcaaaaggg ccaggtcagc tctccagctg 1560 

agagtgacca cttcagagca gacggcagac tccagggtta gcaagcctgg ctgagacctg 1620 

gcccatgaca atcactcaac ccctctgacc tcaacatcct gtctgtgaaa tggggataat 1680 

tactgcacct ccacatcaca gagtgcgagg cttaaacagg atgcttcata gaaaagcgct 1740 

caagaggtaa cagccgggag ggggtagtgg ttttcattaa ttaaatgttg ccttcatcca 1800 

gccctgggcc agctccaaca caaagcacac accatccact cagactcagt tgcctggatt 18 60 

caaagcccgg cctggcctcc agctgtgaga ttccgggcag gatttcccat ctcccagagc 1920 

ctcagtttcc tcattcatga aacaggaagt gatcattcct tttattttta tttttatttt 1980 

tattttgaga cggagtttca ctctagttgc ccaggctgga gtatgatggc gcaatctcag 2040 

ctcactgcaa cctcggcctc ccagtttcaa gcgattctcc cacctcagtc tcctgagtag 2100 

ctgggattac aggcacacgc caccacgccc agctaatttt gtatttttag tagagacggg 2160 

gttttgccat gttggtcagg ctggtctcga actcctgacc tcaggtgatc cgcccgcctt 2220 

ggcatcccaa agtgctggga ttacaggtgt gagccaccaa gcccagttga caactgcttt 2280 

taaagacacc tctggctgct gtggaaaaca gcctggtagt gcctcaaaaa gttacacata 2340 

gaatgatcct atgaccagta attccactcc tacatatata cccaaaagaa ctgaacccct 2400 

ctactcatgt atgtacacat acaggtacac gcatgttaac agcagtgttc acaaagccaa 24 60 

aacatggaaa cagctcaaat gtccataacc gatgaacgga taaatgaaac gtagtctatt 2520 

caccacctga cggaggtgag aggggccata aaaaggaatg atgcataaaa acgaatatta 2580 

tggccaggta tggtggctca cgcctgtaat cccaggactt tgggaggctg aggcgggcgg 2640 

atcacgaggt aaggagttcg agaccagcct ggccaacacg gtgaaacccc atctctacta 2700 

aaaatacaca aattagctgg gcatggtgga gggcgcctgt aataccagct actccggagg 27 60 

ctgaggcaag agaatccctt gaacctggga aacagaggtt gcagtgagct gagattgcac 2820 

cactgcactc cagcctgggc gacagaccaa aactccgttt cggaaaaaaa agaaaaaatt 2880 

agccaggtgt ggtggcgggt gggtccctgt aatcccagct ctacttggga tactgaggca 2940 

ggagaaccac ttgaacccgg gaggtggagg tagcggtgag ctgagattgt gccactgcgc 3000 

tccagcctgt gtgacagaag gagactctgt ctctaaaaaa caaaaacaaa aaaggcccga 3060 

cgcggtgtct tacacctgta atgccaacac tttgggaagc caaggcaggc agatcatctg 3120 

aggtcaggag tttgagagca gcctgggcaa cacggtgaaa ccccatctct actaaaaata 3180 

cagaaattag ccaggtgtgg tggcacatgc ctgtaatccc agctactcgg gaggctgagg 3240 

caggagaatc gcttgaaccc aggaagcgga ggttgcagtg agccgacatt gcaccattat 3300 

actccagcct gggtgacaga gtgagattct gtctcaaaaa aaaaaaaaaa aaaaaaaaaa 3360 

ctaaacaaaa gcaaaaaaac caatgagtaa tgttgtcaag tgaacttcat cccaatggga 3420 

atgcagataa tttgtttaaa aggcaccatg cacactgggc aggctggctt cccctgggaa 3480 

cgtcttcttt tgcctggatt cccagttggt ttaatcgggc gtagaacact ttcttcaatc 354 0 

cgggattcag gcacccctgc tcagcacaaa ctcagtacac cccgcactct gctgtgggtt 3600 

cttggcacta ttaggagaat gtgagggggt gattcagatc tatctctagt gggtgcatgt 3660 

ctgccactcc caggaacgcc cacttctggc aagtcagtgt cagagaaagg ccagctcgtg 3720 

gcccctcctg ccttgagtcc caggacccgt gatcagtcct acccggagca gaatcaggag 3780 

tttgaaaacc caagtgccaa caatctcatt ttaacccatg taagcatatc caatatttat 3840 

atatagaatt cataacagat gtctgggctt ccattccaat agcctatatt ttacactgtt 3900 

tatttacatg gttacaccaa acaagactca attcaaggta acccaatcct ttgctactat 3960 

accaaaataa gcaacatttt cagtccatgc cttatatata ttcaccaagc attacactag 4 020 

gcctccaact gctcatcgga gcaagctgca gcctggacac aagctagaga ttaatcagtc 4080 

aggaatgatc ctgcgtccag tgccagcatg atggaagaga cagagaaaca gaagacatca 4140 * 

gggctccaga gtcaaggagc ctgcaggtta gttgggcagg atatacacac atacacacac 4 200 

acacgcacac acaaaaccac ccaagaagaa aaggtgggat gaatgcatgg acaggtaatg 4260 

cctggagcct ggggatggat aagctgactg caggtggccc aggcaggctt cctggaggaa 4 320 

gaagacctgg ctgtangtgg ggtangcang ctttctaaat ggggaaaatc tggctgtggg 4 380 

tggagttggc angtttccga aaagaagaaa agctgactat gggtacacct ggctgttggt 4 440 

ggaacangca ggcttcttgg aagaagaaaa tctggctgtg ggtggatcan gcaagcttct 4 500 

tggaagaagt aaacctgact atgggtggac caggcaggct tcctagagga agaagaccgg 4 560 

ctgtgggtga accaggcagg cttcctagac agaggaagat ctggctgcgg ttagagtggg 4 620 

caggcttcta agaagaggaa gggctgactg tgggtagacc tggctgtggg tagactgggc 4 680 

aggcttcctg gaggaggaag agctggagca ttgaaaaaca aacatgactt ggtgaatgtt 4740 
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gagcatgccc aggcctgatc cccagaggca attacgcact caagttactt aattctactc 4800 

acaatgcctc acaaacaact tctctgacac ctaacacagc tctgggcacc ttctagcttc 4860 

agctcctcaa agcagttatt cacgctacta ccctgcacac ctcctcacac cccaacccca 4920 

gggacaggag ttctgccaga tgccaaagct cctgatgcca aagcctgggt ctgcttccgg 4980 

gctcctcttg gtctaactgt ccaccccgca tcggcatgat gtgcaaaaac aaggctttgc 5040 

aatctgccct gatgcctggc ggagcgagtc cctcccgatt cgtctccttc agaaacacct 5100 

gggctgccct ggtcctgtta tacccccaac acattctaca gtcagctccg caagttccac 5160 

aaagatcaac gctggcgttt ttatggcatt ttatttacag tttttacaat ataaaaaagg 5220 

aaggatgcca cagctcagcc agcaggacag acagagatct atgatgcttc tgctgcacca 5280 

ttgtttgtgg tcaagaaagt ctgttttcaa tgatttatta aattgtggtg ggagatggat 5340 

ggtggcagtg gttaccagca acatgaatgt tcttaatgcc actgaacttc acacttacaa 5400 

atggttacga cgataagtgt tatatgtatt ttaccacaat taaaaacagg taaatgcagg 54 60 

ccgggcacgg tggctcacga ctgtaatctc agcactttgg gaggccaagg caggcagatc 5520 

acctgaggtc aggggttcga gaccagtctc gccaacacgg tgaaactctg tctctattaa 5580 

aaatacaaaa attagccaga tgtggtggtg catgcctgta atcccagctt ctcaggaggc 5640 

tgaggcagga aaatagcttg aaaccgggag gcagaggttg ccatgagctg agattgtacc 5700 

attgcactcc agcctgggtg acaaaagcaa aactctgtct caaaaaaata aaataaaata 5760 

aaaataggta aatgcaaaca tatggtatag taatattatg ggctattatg agctacaaaa 5820 

aagaatgact tgggactaca gttacagccc tcattcagga atttgtttta aatgtgggtt 5880 

ggtcgctaag gcatgtacac aacattttga cgttcaaata ttcctagatt tggacagtga 5940 

gcacccctct aagctggctc ttctgtccca gaggtcccca ccagtcctcc agaacttctt 6000 

tgctttctta cacaataaga tgccccatgc tcggcttgta cctttccttg ccccagccct 6060 

agaaccagct tcttcgtgga caagctctga ctcctttggg tggagaatgg tattcagaaa 6120 

cccagacctg ggctctggtg tgctcactgc tacttggggt cattgcttct aggcctctct 6180 

gctgatggag gtaggatata cacgtacagt cttccctctt cccagattcc gtacttgagc 6240 

tcgcctactt gctaacattt atttatatcc cccaaattaa acctcacagc acttctgcaa 6300 

tcactcactg acttgcagag tgtgaaaaaa ctgagtcacc atcacacgtt ccaaactgag 6360 

gtcaactgag gccacaacgc cccatcttct tgctccggct gtcgagatgt aagcaagtgt 6420 

ccttctctcg gtctagctag tgccatgctt tccacatcac tgtgcttttt gtgggcaatt 6480 

ttgctgtata aaatgtcccc tgcacatatg ctgctgtgta gtgctcctag gtgcatgagg 6540 

ctgccccacg ccttacagag agaatatgca tgagaggctt tattcaggta tgagttatag 6600 

cgtagttggc catgaattca atgttaatga atcaacaata tacagtaaat aaggtgcttt 6660 

ttagagacag ggtctcactc tgtcacccag gctttagagt ccagtggtgt gaccttggct 6720 

cactgccgcc tcaacctcct gggctcaagt gatcctccca cctcagcctc ccaaactgtt 6780 

gggattacag gcgtgagcta ctgcactcag cctaaataag gtgtcttaga aacacacata 6840 

agacaaggtt atgggctgag tgcggtggct catgcctgta atcccaacac tttgggaggc 6900 

caaggtggga ggttcacttg aggccagaag tttgagacta gcctgggcaa catggcaaga 6960 

cctcatctgt atattttttt aaatcagaca ggtgtggtgg tgcatgccta tagtcccagc 7020 

tactggagag gctgaggcag gaaaatggcc tgagcccagg aggtcaaggc tgcagtgacc 7080 

catgattgta ccactgcatt ccagcctggg gtgacacagc aagacgctgt cttaaaaaaa 7140 

aaaaaaaaaa aagccaggtc aggtatcgaa cagttggcaa aaacgttgtg acctgaggct 7200 

cacaggaacc tagcccgatg tttcccctag gagcaatggt tcagtattca ataattcagg 7260 

gttcccagtg actttatgga gcataacttt caagaataac aagaaccaac tgtacgtgtg 7320 

tatgtatact cacactttta ttttatttta ttttattttt tgagacagag tctcactctg 7380 

tcacccaggc tggagtaaaa tggcgtgatc tcgactcact gcaacctccg cctcccaggt 7440 

tcaagtgatt ctcagcctcc caagtagctg ggattacagg tgtgccccca caaccggcta 7500 

atttctgtat ttttagtaga gacggagttt cgccacattg gccacgctgg tctcaaactc 7560 • 

ctaacctcaa gtgatccacc cacctcagcc tcccaaagtg ctggaattac aggcatgagc 7620 

tgccgtgcct agcctacata cacttttata cacacatgca tctatgacta tttctctatt 7 680 

tctgtgcatg tgtgcgtggc agtacctaca gtttcagcta tgtgtctggg tactgtctcg 7740 

tccaagtttg taagcacctt ctccaaagtg caaagcctgg cttgtgttac tatccatatg 7800 

tttacttatt tgctcaatca atttacttat tagctccata accagcttcc catctgctcc 7860 

agtagcctct gctgtcagtc acctctgcac cctaccccac cttgcttccg gatgctggat 7920 

gccaatcacc cccgacacct ctacatagca ccaccctcga catgctgctt ctttatttct 7 980 

tatttatttg tttgagatgg agtcttactc tgttgcccag gctggagtgc agtggcacga 8040 

tccaggctca ctgcaacgtc cgcctcctgg gttcaagtga ttctcctgcc tcagcttctc 8100 

aaatagctgg gattacaggt gcccaccacc acgcccagct aatttttgta tttttagtag 8160 
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agatggggtt tcaccatgtt ggccaggctg gtctcgaact cctgacctca agtgatccac 8220 

cttggcctct caaagtgctg ggattacagg tgtgagccac cgcgcctggt ctgcttcttt 8280 

aaatgccagg caccaacatt tgtgcaatgg ggtgggagga aagaacaggg aggagagcac 8340 

actgccggcc cctgcactga atccactgat caatctgggg gcaactgcca tctccatctc 8400 

ctgtcttcct atccgtgaac atctactgca gtcctctcca atgtccttct gtaaagttgt 8460 

attatgtttt gcatacaggc cttgcatatt agttctcaga tataatccat atactttata 8520 

taaaattcaa accacattta aaaaaataaa actagcatga ctataacgga gtctgcaaca 8580 

ttctcacaga ctttatgata aaacatgaaa cttcaaagat acttagggtg gggcagggac 8640 

aatgtttaag gctgcctgga agcctcccca tccctgagcc agaaagtcct atctcccctt 8700 

caaggggaaa tgcttgaaaa agcactgatc aggctaaaat gacagggatc agggagtaat 8760 

caaagtacaa gtgagctggt ctcctccatt ctgagcacag caaagttcag tctctccaag 8820 

tccaagaatc atacacctgt ttgccaagaa tgaagttcag gtgtctacaa gtggctgaaa 8880 

atattcattg ctgggccatt aacaacattc ttggcaaaac cataccttag cttctcgtgg 8940 

aaatttctta aggtagaaga aacaggaaac acccaggctc gcttttatgt agacagttcc 9000 

atgaagccag ggaccttccc cacatccacg tttcaattac ctgcacgcag ctcacagtgt 9060 

attcaacatc tacgcgtctc tcctactggg gtggcggtgg ccactcaaac cctcatgcag 9120 

ctacgatgac cgcaattttg gcaacataat ttcatgtttt tccttgggct tttacccaag 9180 

tcagtgacac aattctgcag ttgtctaaag attcaaaatg agggacttga catttacaac 9240 

aataataaaa tcttgggttt cctttaacca agcacatgtt ctgcctttta gagaaagctc 9300 

tgcaaactca agctggagtg ggatacttgc tgacatcttc aagcacccca ggaatagctc 9360 

tactccccca tttccacctt ggctgaacca tctatatccc accaattccc ccaacatccc 9420 

tccatccgtc catccatcca cccaaggacc tgctaagcca ggaggtctct cccatctacc 9480 

ccacagcctg gcctcagccc acaagggctc tctctacatg aatcccaccg caccagagta 9540 

gaccaagtct cccgtagact ccaccctgac cacctccatg cctccagcca ttcccacccc 9600 

taaaaaccct ccctggtctc tacacccagc tgatgaatac ttggctgaat gtgacctggc 9660 

ctcctggacc caggtgaagc ccacgtcctc cgtaagcccg ccagctcacc ctgcctctgc 9720 

accttcactg gagagagccc gcacttcacc tcctcagggc aggcatggct gatgccaccc 9780 

agtggaatct ggtgcaaagc agggcccggt gcagagcagg gctgcctgca gagcaaggcc 9840 

ctggtgctgg ggccgagcac ctccaatgct ggccgtggaa ccatccctcc cattccaggt 9900 

gctgtctcca tcaagaatga gcgagctgct gacatttgca tgacaataat gaataaatac 9960 

catattttgc ttcaaatcca gaatagatgt ggccagggtt ggcatatgac tgttgggaaa 10020 

ggacagtttg cctcttccca aaccaacttg gattataaaa 'agcttttctt aacgaccaca 10080 

agagcggagg agctcagggg cagacaaaag gaaggctggc tgcagaaggc gggagagtgg 10140 

ggccttcagg ggcgggtggg gagagagaaa gcctggagct gcacccccaa ggtctgtgta 10200 

catcaggtgc tacagaataa caccacctct tccagcttgg cccccacctg ccctctccca 10260 

gcccagtcac ccagacagca ccccactccc cacacacacc tcacatctgc ccgcctcaca 10320 

ctcaccagct tcggctctca atgcaacctg gaacctgccc ttggcctctc agctcagcca 10380 

cccccattcc tgttggcccc tggcccccca tcgaattctc tctaatccta atgcacacac 104 40 

ttgcacactc aaacacacac acacacacac acacacacag cccagaggaa aaccataatt 10500 

gactgaggtc caggcaagtt tcccgagcag ggaccacatt tcaaaggtca gggaagcagg 10560 

cgaacaggaa acatacaggg ggcacgtttg ggggtggagc aggaaataag aaatcacttg 10620 

caaaagataa aaagaaaatg aggtagctgg tttcagacac ctcggagcac acagaacagg 10680 

acaggcgcct ccgggtcttc cctcaacagg gagatgggcc aggcaggtcc ctgctgctcc 10740 

accgcagagc tgggggctat ggccctgaca ccaaggccct ggggcaggcg gggaggcagc 10800 

tgttctcctg cctgtgctcc cgggcagggc ctggccccac aagggaactg gccgaaggct 10860 

ctgcttggct actccggaaa gtcctgggag acaagcaaag gacttgctag gtcactccaa 10920 

acggcccaga tgtgacaact gtgaagaagc cacaccaaag caaggtgaca gaacaatgtt 10980* 

ggtgacgtca ggttatcagc ttacgctcaa ctccacttac ccggactcac ccgtaacctg 11040 

ccgtctcttc ccaaccagta aaggatgcct aggtagaggg gcacaaggcc tggagcataa 11100 

ttaccatttt aaaggctctg agaagtcctg cggtgaggaa gcctagttca ctttctctcc 11160 

cctaggattt cccaactgcg cctgatcaca gaacattttt tcatttccac tcaggaaaca 11220 

tattttgaaa aacactggcc tagaggcaga agtgaaatgg aaaacacaaa agtaaaactg 11280 

aacaggaggc actgggcaga gaacggtcag aggcgccctg aatcctggac cggtggagat 11340 

ccccagcttg gcatgctccc ctccctgggc ccagaccgcc tccccccatt tcctggataa 11400 

gaaggctaat gcgcatcagg gtgaagggct tgcctgggct acacccccag gctcgcccca 114 60 

caccaatcgc gctcctgcga gagccagtga ctttcttgat ttggctactg tggaattgtt 11520 

tgcaactaac caccccagat acagatacaa atgacaggat gatcagatgt aaaggaccca 11580 
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caggtctctg tgatacggct tcatgcagcc agcatggcta gtgccgtgca gaatgagaat 1164 0 

gaccccaggc aagtecttgc ctcccagacc cagaacccca tggagcccac cagggctggt 11700 

tcacaagcac tgtctgggtc gggcagagat tccagcaaga ggagggaaca tccatgcacc 11760 

ggagccagtt accagaagca aatcgcctct tccaaaaccc aggctattaa tggagtccac 11820 

tgttgagtgg agctggggtc tagctatgga atactgcaca gcagagatct tcctgagaga 11880 

aagcagtttt ccctgaaagc catgtgtcct ccactaactg tgttttaatt gggcgaacgt 11940 

ctgtatctca ttgcagtggc cgcgcatgtg ctgacaaggg gctgggggcg gggtggggag 12000 

cagaagctca ggggcctggg agggaaggaa acaggccacc agggctcccc agaaggcatg 12060 

tatctctctc acaaacacac gcatgcacac acacgtgcac acatactctg caagccctga 12120 

gttagcaact gtggaatgtg accagctcag tgatcccagg acaagctgct agggaatatg 12180 

acatttgatt gatgtctgca aatgtgcgtt ttcactaatt agaaggttta gggcagagca 12240 

gagaaaaata tgtatttcag agtcccagtt tgacctgcca gaaaccagcc cattactaac 12300 

attcttattt tcaacaaaat atagcattct gattacatac catcttggtt ccacgcctcc 12360 

tgccttgcca agcccccgga agcggcccaa ggccatggca aatagtgaga gaaacagttc 12420 

cagggtggag actgactcag gggtgtcagt cagtggggcg ctgatggccg gtgggaggcc 12480 

agcagtcatc accctctcct tgggacagtt gagtagctct cccccagggt catgtggcca 12540 

ctcaggttca tatgggaggc gagaggagtg gcagagtcca ggagagtggc tccgaagtca 12600 

ctgttccctc caggcctcag tgtcttcatc cattaaatgg gtaggctgag gtctgggatg 12660 

acaaggaggg cttgcactta ctgaaaccca tgggaggctg ttcgccgatt tcttttattg 12720 

atggaagaaa acactcgtat aattcaagta ccaattaaaa ggcaggcact ggaaccaccg 12780 

tctgccaatt cctagttttg cctataccaa atttgagcaa gttaattgac ctctcccagc 12840 

ctcagtttct tcgtctgtaa aatgagggta gggatggccc ccagcccaca gggcagctgg 12900 

aaggattaaa gaaatcaaac atctcttaga gcccacctgg cacactgtga tacacaacaa 12960 

atgttagcta tttttgtcta tgaagtctag attttatatc ttgggtgttc taaagcagga 13020 

tacatttatt taaaaacaag gattttcatt aaacacgtac cccacagaca gcaaccccat 13080 
ggagactgct cttaattcag gccagtatcg aaacgactct aactacaagc tttatacagg . 13140 

tctcttggct gtccttcaaa tccaactaag gtggtacttc tgaagcactg tgcacatgtg 13200 

tgtgtgcatg cacacgtgtg ggaagggcgg gctcacggat ccctcaggta ccccacccac 13260 

gcagtctcaa gtcacaaagc gacagagcag ccgaggaagg tctgtgcccc actggaccct 13320 

cgtgaagcca ccaactctac ctctgcgccg tgtcctgcag actgggctac cctttgggtg 13380 

gggaccagca tttgatgcaa gaaaggcaga cagaaaagga aaagggcaag ttcgactcca 13440 

gataacacag acagtaccaa gccccagggt ccataaatgc cacgcagatg gaagcattta 13500 

ctgcgaggcc acacagcaaa cgcacggatc cagggacgga ggtgcagact gcggtgcccc 13560 

tgagccatga ccctgcaaat taccaccatg ggaaaggagg ctgccaaacc ccccgacagt 13620 

cggctgggct ggcacagact cgtggtttcc atcgaggtgg gaggaggtgg gacgtcccag 13680 

cccctccccc atgcccactg cagagggaag cggccgtttc ccctgtgtgg ttacaaaggt 13740 

ctcattgttc ttcctcacag ggaggaaact ggaggaccga gctcagaacg cattttagaa 13800 

ctggcagaaa agaacatctg gggaaggaaa cacatttcag aaacaaacat acctttgtac 13860 

cagcttttat tttctttaag tgttgaaaaa ataataataa taaagacatg ccaaatttat 13920 

catcgctcta caaaatccct ttattgagca aaacgtggca gctctacttt caaatgatta 13980 

ctgttcctgg aaaattgcag caacgtggat gccaaggccc gaaggccgcc atcagcagcc 14040 

aaacaaaaga tgccacctcg ggctccgcga cactgtacca tgccagggaa ctggacagat 14100 

ttggggaatg ccacggtttg cctttaaccc cttgcctcct ggtctcctga tgcatctcag 14160 

aggctaacat tctttgagga actggcattt cttagttgta aatatgcatg tgggtttggg 14220 

agctgcctgc aaagtccagt gttgacgatc agctttgatt tccttggaat caagtttacg 14280 

tgtcgagtct ggaagttaag aagaatttgg agaagctgag cactatggtg ttgcaggccc 14340 

tgggtgaact cttccaccaa gcattcattg tggactgaca gcgtgcgagg ggctctgcag 14400- 

gcaggtgcac aggacgaaac acattccgtc cgggggaaac ctgcaggaaa gctccctctt 14460 

cttcctaagg tgccgggcct agcttcatgg gtccctaccc tccacgcctg tcacactttc 14520 

tgagtctcat gtgggagctg cttctggttc ctgacttcac tcagtcctca taggaggtgg 14580 

aactactgtc accccatttt acagatgggg agactgggca caaggggacc aagaaaccaa 14640 

tgcaaagtca cacttgtggg atcagtgaca ggggagatca attcccaggt tctttctgca 14700 

agagttaaat tgttttcatg ctgcctaagg gggggcaact gaaagaccac tgcatatctt 14760 

tgccaaaagg gtcaagcaca ggagccgcag ccagtgggtc agatccgcag aggcgctggg 14820 

gtgaccctcc ccatacctgg agggatgctt gtcccctcct ggccttcact gggtcccctc 14880 

atgaccgtgg cctcccagga cctcagcaca atcccggtcc tgtgctccag gacaagccct 14940 

ccgtccccaa gactgtgagg aaatggaacg aagaggggct cgctgcagcc cagcacccac 15000 
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actgcccctt ctcaggggca agaaccgtcc tggaggactt ggctttggag ggggagcctg 15060 

ggaggccagt aagtcaacaa gcctctactg ctcatgggtg ggatcccacc gcaggccccc 15120 

acctgctggg gcgggcaggg acgggcggca cagcttggcc agggcagata acccccacct 15180 

tggccagggc gaaggcagga cacgtgggct ccagcctggc cccaccatcc ctgcacaaca 15240 

ctgggcaaag tccacgtttt cctcaactgg gtgttgacat ctgcaggaca ggggcatgga 15300 

ggtacagagc gctgaagcca cacagcaacc taggagcgag actccatgcc tccccgggga 15360 

cccctcccca ccatgaggac catgaaggct tcccatgtgc cgcaaggact ctggtgtgga 15420 

gacacacgtc tcctacacag ccaggcctaa cgctcttgta actgggtggt cccacctggg 15480 

ctcacagctg gagggccagg agctcaaggc ttcgcagggt ctgctctcat cccagaggcg 15540 

atggggagcc acagcaggct gcaggagaga gggtgggccc cctccacttc agaggcccca 15600 

tctggcccac agactggaga gcacatctct cagcaaccac ggagcgccaa ctgcgcacag 15660 

ggcctggtcg tcagagcggg gcaaaggcac tgaccgtcac ggccagggcg agggaagacg 15720 

ggtgggcagg gaccttgggc agagggggaa gaacctggtg cccaggctgg ccctgccttc 15780 

agcagtgaag ctgagtgggg aggcgctgat gcagggggcc agaaagggct gctggtcagc 15840 

cgggaggagc cccccacaga ggaagcagcc agcccagacg cagatggcag ggtcccctca 15900 

acaatgtcct ctgaaaagga gaggcgggga ctgctctggt gacacctaca aatagatagt 15960 

cagccctcag ccccctgcca tacttctgac aaagcagagg cccccagggg aggcgcaccc 16020 

gaaggtacct gcacctgtcc cccagactcc tagagcccac ctgaccccat cccaccaggg 16080 

ctccagctac aaaataaatg ccgaggccag ctaggcaagg acgcacactc ggtaccgact 16140 

gaataggctc cacgttgtca tgagcgcaac ccacaggcca ccaggccaca ctatgcagag 16200 

ctgagatggt ttcggccaag cagcctctca gctgagctga acaagtccag agtccccggg 16260 

gggtcgtcac tatggagtaa caattgcgat gcgatggtaa ccctaacagc taaccgtcac 16320 

tgagccaggc cctgagctag gtacttttca acgctgcctc tctgcagcct caggacgagc 16380 

ctgtgggagc ataaagatca ttccctatca cggatgggga aactgagctc tgaagcagtt 16440 

aacgtgcttg tcccagaccg cagagctagg agcaggacac aacagcaggt caggcaggaa 16500 

cgggtgaggg gggcctgcat gggcttctct ggaggctgcg catacacgca acccccagga 16560 

ccccgaccct gcacctgcag ctcgctactg ccccctcagt gactccagca aacctcgggg 16620 

taggggaagg aggctgggaa tacctcgggt gtccgaaaca gcagcttctg cttggaggcc 16680 

actgctgcat aatggt.tgct gcccagcaca ccccaagcca cctgtgccac ctgtggtgac 16740 

cttccagcat gccttggtga ccaagctggc cttaggtgct gtgggcagcc aagaatagaa 16800 

cagggcccac ccctcctctt cacactaaca caaagcaaga ggcgggcact tcgactgagt 16860 

gcatccctct agctcaaggg cctcacggat cacaggggtc agggcaagat cccaattctg 16920 

cattcccgtc tgcctttcat cctgctctgc caacaacagc cagtgaggct ggggacatcc 16980 

ctgaacctgt ttctcacctg aaacacatca taccattgga ccccagccct ccgggagagg 17040 

ccctaatccc tgactgtggt gagatcagat cactggttaa gtacccagaa gggccttggt 17100 

caggggctcc aggggtgggg ggtgatgggc gtggtggtat cccgctctgg gctatagtcc 17160 

accctgatgg aggaggtctg tggtcagaac cgggctgtgc agggcacagg agcccagagg 17220 

gacccccaga gctcacctgg tggtctctga gcagggctcc ctcaaccctc agagaaaagc 17280 

acagcaagga ggccgcccag agcccagcgc ctagcaccca gtggcgtgcc agacctgcct 17340 

ggatcctgga gatctctcat caccctccaa gtcagtcatg cccaacccag ggacccacag 17 4 00 

cccacggggc cgtgaaggtg tgctgagtcc aagaaggcct tcgacactgg gaagccaagt 17460 

ggcacctcct ggtgtggagc aggcggaatc ccaccagcct ctgctctgcc agtgggcaca 17520 

gctggacgat gagcagaagg ggctgttgct taiataaacgt catttcctta agaggataaa 17580 

acctttcaaa acagatggaa attttttttt aattaaaact ggtggccaaa gagatggaaa 17640 

gcaccccttg tgcctccctc ccatcgtgac ccatcctctg cacacctcaa gctgttcgct 17700 

gcccaggtgt ctcctgaggc actgggggcg ggtgagaatc cgtgagccct cggccagccg 177 60 

tggctctctg gagctctgcc ccaggccatc agggcacacg ccgggcaccc tgggggccac 17820* 

acagggcaga gcccagctgg gtcagcacac agggccacac tgggcacaca agtctctgag 17880 

cctcccctgt ggacgcagct ctcactatcc caccccacta ggtcccgggg atctgtccca 17940 

cagggtgata tgctgtcaca gaccactacc agagccatgg cctgctgttc cgcccgcagc 18000 

caggtagtca cttgctccac agggacaggc aacgccgcac ttgggggctg ctctgcggca 18060 

ggactagagc tccagcagct cagccctcct gagaaggaga actccatgct ctaagaggca 18120 

gacgcagcgg acggcaccaa agccaccaca agcccacggg gccctgcatg gcaggtcagg 18180 

agtccctgac cactcgctct ttgtaaccag agctgcagtg gagtctacga ggcaaggact 18240 

gtgggcggca gtggccacag caaatgaatg agtgtcccaa gggagcaggc ggctgcgggg 18300 

aggcacagcc gggacccagg agtcctccgg cactgcagca aactccctgg gccccctgag 18360 

cagcgaccag gtggcaagtg catgaactcc cgggggcata acctgggagg gtgacactct 18420 
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cttcgtgttc aaattcttga gaacgcatta aaaatatcac tcagtcacct actctatagt 18480 

tttaactcaa aagtaccaaa gtagccaggc gcggtggctc acgcctataa tcccagtact 18540 

ttgggaagct gaggcaagag gatcacttaa gcccaggagt tccaaatgaa cctgggcaac 18600 

atggagggac cccatttcta caaaaaaagt gttttaaaaa attacctggg cctggtggtg 18660 

tgtgcctgta gtcccagcta ctcaggaggc tgaggcggga gaaccacatg aacccagggg 18720 

aggtagaggc tgcagtaggc tgtgatggca ccactgcact ccagcctggg taacagagtc 18780 

agactctatc tcaaaataaa tttaaaaagc accaagccag gcttggtggc tcacacctgt 18840 

aatcccagca ctcagggagg ctgaggcaag tggatcacct gagtcagaag ttcgagacca 18900 

gcccagccaa catggtgaaa ctccatctcc actaaaaata caaaaattac ccaggcgtgg 18960 

tggcgggtgc ctgtaatccc agctactcag gaagctgagg caggagaact gcttgaaccc 19020 

aggaggcaga ggttgcagtg agccaagact gtgctactgc actcaagcct gggagacaga 19080 

acgagactcc atctcaaaaa ataaataaat caatcaaaac caccaagact ttttaatata 19140 

aacatttatt attccataat tccttttttg catgattaaa aatgtttata taaagtttcc 19200 

tgaaaatggt aagaatgcca agtgaaggct gcaaatgccc aagcccccac cgtggcatct 19260 

cacggagtct gggccctagg aggctggtgg gtaccacgtg gacccgagac ttcacagtca 19320 

agtccctttg gggtacactg ggtttcccac accccagaaa tatgggctct tactgcagga 19380 

ccatgggggt cctcacactt ggcccagaag ctgtcacata gccagacagg tgttctacaa 19440 

cctaggctag agggagctca tgctccagca gaattcgagc cagaggaggt aaaagatggg 19500 

taagatctgc tccctggaca gatgaggcct tggcctcaga acagttactg atcatctacc 19560 

agacatcaca ctagaggcag aggggcgcag acgaagacag cccctgtcct caaggccctc 19620 

ccaggttggg tggaccatgg aaggttccag acagatctgg caagagaagt gcccacacca 19680 

ggggcagaag atgggcaggt ctgctcaggg cggcacggcc tgccaggcca aaaagttcca 19740 

acttcagatg ctggagaatg ggcacgactg tctgagaaag ggaaggatgt gatgaaaact 19800 

acttggagaa aaattaatct ggccagagca taagataaat gggcaaaggg gaggttccag 19860 

aaagcaagga gaccaagtaa aagctgatgt cattggctct gaatctaggc tttcactgaa 19920 

tatgcaccgc agggcctgta ggtaaagcct cagagcccag ggagtctgag tggaggagag 19980 

ggcaggggac agagctgggig cctgtgtcta cagtgctcag gaggaatagg catggacgtc 20040 

agctcggagg ctccagctga agtgaggagg cggccagggc agcacggcca cgcccggatc 20100 

cagactcctt ttgggaagca agttcgctct gggggaaagt ttggagaaat ggcctttacc 20160 

cgcagaagca agccccagaa catatcttgc tccaaaacta tctcgtacag tgaggacgtt 20220 

aagcttcagg tcccctagag gagacagtct gctccttcct ggggcagaac ccaaggtggc 20280 

cagagcctgg aaggcaccca gcacccaggc tggtgtgttc cagcccaggc cacacgctca 20340 

gatagctatt aatgccccgt tgagcaattt cctgagagct ttgccaggca ggtaccgcct 20400 

ccccatctga actaatacag gggtacatcc caaggaagaa atgaaaggtg cccacatttt 204 60 

gctctgggat taactaggga ggggagtgat aattaactca gtaattatat ttgccatcgg 20520 

gctaatgcta aaattagtgt gcattagaat ttctttcctg agcagacacc ggagtgagtt 20580 

gggcagcagg agtggctcgg gcaagtcggc acaaagggca cctccagagc cttccacaaa 20640 

tgtcagcaaa acccacaaat gtcaaggccg gctccactgc acccagcaga tgaattcact 20700 

tccacagcct gagaccgcca gctcatcgga ggccatttaa aatccagccc tctgacacct 207 60 

gctggatatc accatttacc gtccccagat caagagatca aagggtggaa cctgatagga 20820 

cggctctgaa gttcaccaca aaagcataaa cgtgcaagca gagccaatac gtcttttgaa 20880 

aaggacaatg aggtgggaat ttacataact gatcttaaaa tatgttctga tgcttcagag 20940 

atggagacag cagcattccg gtacacaaag acactcacag gcagtggagc acagtgaagg 21000 

gtctggaatc aggacccagg tgtctgtgga cactacacat aaaagagcag catttacaat 21060 

gaatggatag gatggaccat cccaccaagg tgttggacaa ctccctattc actggccaga 21120 

cccctacctc ataccatata caaaaaaaaa aaaaaaaaaa aaacccagac agaataatgt 21180 

ctgaatgtaa aacataaaac agtaacagtc ctggaagaaa ataatggagg atatatttat 21240- 

aatctggaga tggagtaaca agggatagga aaaaagccat agggaaaaag tagagttatg 21300 

attatatgaa gcttcttaat atctttatga taatgtacca ccagaaacaa ggatgaagga 21360 

ctagctacag accagcagtg aaacctgaaa caaacagaac aaagaattaa agtccatacc 21420 

aaataaagac ctcccacaaa tctataagaa aaagataaac aggctggcac cgtggcttat 21480 

gtctgtaatc ccagcacttt gggaggcgga gatgggtagg tcacttgagg tcaggagttc 21540 

gagaccagcc tggccaacat ggtgaaaccc tgtctctacc aaaaatacaa aaattagcca 21600 

ggcgtggtgg cgcatgcctg tagtcccagc tacttgggag gctgagccag gagaacagct 21660 

ggaacccggg aggcagaggt tgcagtgaac caagatggca atcgcgccac tgcactccag 21720 

cctggaggac acagcgagac tctgtctcaa aaaaaaaaaa aaaagaagaa gaagaaaaaa 21780 

gaaaagaaaa agacaacaga aaaatgggcc aaggataagt gtaggcaatt tgcagaaaag 21840 
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taaataccaa taaaccagaa atgagggttg tgcaaatcaa aaggtgttat aatttttaac 21900 

caaactggac caaagaaaac accaaaaacc aaaatcttgt aattgccagc atcagagagg 21960 

atataggaaa gtgtgtgttc tcgtagatgc ttgcaggtat gaactgctac agccttttag 22020 

gagttatgta tgtatgtatg cttgtatgta tgtatttgag acagggtctc gctctgttgc 22080 

ccaggctaga tctgttgcag tgctgtgatc atggcttact gcagccttga cctcctgagc 22140 

tcaatagatt ttcccacctc agcctttcaa gtagctgaga ctacaggagt gtgcaatcat 22200 

actcagctaa ttttttaaat tttttgtaga catggggggt ctcccaattt tgcccaggct 22260 

ggtctcgaac tcctggactc aagtgatcct cctgcctcaa cctcccaaag tgctgggatt 22320 

acctggatga gccactgtgc ccggcctcaa tatctttaaa aacagaaatg gacacactct 22380 

ttgactagga atgtatccta taaaaacact tatacacatg cagagacaca cgagcaagca 22440 

tgctttgtaa tagcaatgaa ggctggaaaa actcctcaat caggtaaatg ctgtcaagtg 22500 

cacctgtgta ctatgaaatg gcacttggct tttaacaaga gcaaagacag aaaagcaaaa 22560 

gtacaaagta gggtgtgatg gcacatgcct gcagtcccag ctactcagga ggctgaggca 22620 

ggaagatcct ttgagcccag gagttggagg ccaggagctg ggcaatagtg agaaaaaata 22680 

aaattaaata ataataataa taaaataggc tgggcacagc ggctcatgcc tgtaatccca 22740 

acactttggg aggctgaggt gggaggatcg cttgatccca ggagttcaag gccagcctgg 22800 

gcagcaaagc aagacaccca tctcaacgac aaattttaaa aaatcagcca ggcaggctgg 22860 

gcatggtggc tcacgcctgt aatcccagca ctttgggagg ccgaggcagg cagatcactt 22920 

gaggtcagga gttcgagacc agcctggcca acgtggcaaa accctgtctc tactaaaaat 22980 

acaaaaatta gctgggcatg gtggcagatg cctgtagtcc cagctactga ggcacaagaa 23040 

tcgcttgaac cagggtggca gaagttacag tgagccgaga tcgtgccacc gcactccatc 23100 

ctgggcgtga gtgagactcc tgtctcaaaa aaaaaaaaaa aaaaaaaaca aggagccagg 23160 

cacggtgggg tgagggaggg cacagaagca gcgcctcttc tgggggcacc cccaatctct 23220 

agcgatccag aggcctcagg atcctgaagg gagaaaaaac gtgaagctcc gtgctagaag 23280 

agaccataga gattggaatc agctggttct attttacaaa aaaaggaaac tgaggccctc 23340 

agaaggtgag tgcctctcaa tgccccacag ggaggcaggg agagggctct gagccctgca 23400 

gggccctgga ttcttgcaat ggggtggagt ggagcctgtg ccgcccccac caggcacctt 23460 
ctcaggagag gagccgttgt catatccttg aaggggtcct tgagcccctc aaaaggctaa * 23520 

aaaccacttt cctccttgag tgaaccttca cctcagttta accacaagaa aaactacatt 23580 

aaggcccagc gcagtggctc atgtctgtaa tcccagcact ttgggaggct gaggtgggtg 23640 

gatcgcttga gcccaggagt tcaagaccag cctgggcaac atagtgaaac cctgtctcta 23700 

caaaaaacaa caaaatcagc tgggcgtggt ggtgcacacc tgaggtccca actacttgcg 23760 

ggctgaggtg agaggattgc ttcagcccag gaggtagagg ctgcagtaag cggtgactga 23820 

atcactgcac tccagcctca gcaacagagc aagactcaaa aaaaaaaaaa aaagcaggcc 23880 

gggtgtggtg gctcacgcct gtaatcccag caccttggga ggccgagcgg gaggatcagg 23940 

agatggagac catcctggct aacacggtga aaccccgtct ctactaaaaa tgcaaaaaat 24000 

tagccgggcg tggtggcggg tgcctgtagt tccagctact caggaggctg aggcaggaga 24060 

aaggcgtgac cctgggaggt ggagcttgca gtgagctgag atcacaccgc tgcactccag 24120 

cctgggcgac agagcaagac tccatctcaa aaaaaaaaaa attaaatctc aaaaaaaatt 24180 

acattaaggc aaactaaaag atgtttaaaa tatatatatt aaattaaata cactccaata 24240 

gagcaaatac gaaaataccc agaaaacaca atccccgcac ccccaggaca acctcccagg 24300 

gggtccacag caagagaccc caagcacgag agacagagaa cagtgtccct gtggcggaac 24360 

ctctggccca tcaggctcta ttagaaaata aggctcttgc cactgagaga aagaggcaca 24 420 

gtcgcccagc agccacgggc tctggcacac cacgagtcag gccagcaaag tgtcaactgc 24 480 

cccctacaag gtgacaaact aggacaaact ggaaaccaga ggctggacct ggagcacagg 24540 

gaccaccaca tggggctggg gaatgggcag ggacctcaga gcgccaccca catgcctaag 24 600 

agcagcgcgt atgcgcatgc ctctgcatgg cttagggaca cagggagctc cccccacccc 24 660- 

caacccagga aggcagcccc cactacccag gtagggaacg gataggacca gcaccccgtt 24720 

ctgctcgtaa ctcagggctc caggccccct cgggggcaac cagcacagag ctcagacccc 24780 

aaatatcttc acccacctcc tggtccccat ctggacaagg gtgctgggga ctggctctca 24840 

gtcacaccct cggggtactc ttcaaaggac agctggatgc cccagggcag gagcttttgg 24 900 

cccccagctc cctcacccca gacaccagct cttgggaccc caccagcatg ggcaaggtgg 24 960 

acaccatcgt cccgattttg cagatgagga aactgaggct gagggctggc acacggctct 25020 

ccagagctga agagaatgca gagagcagcc ggagccagcc ggtgggtccc tgaggccggc 25080 

tcgtagcaag ccacagctgc ctccgcccat cacacttgga cctcactggc cccaggacag 25140 

ccctccaggg cggcctggca cagagcccac accctgctgc ttcctgaaca aataagtgaa 25200 

caaggccacc aagccgagga cctggatgta gccccggctc ccgccagggc ctccccaaca 25260 
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gactccccat 
gctgtctcac 
tcagggccca 
ctggctcgtt 
ccaccaggat 
gatcaatcca 
attattcagc 
cagcactttg 
ggccaacatg 
gggcacctgt 
ggcagaagtt 
gactccattt 
actcctgaaa 
cttcatgcaa 
ccaggggatc 
ggggtgatgg 
ccaaatgcct 
aaaaaaaaca 
taactcctgc 
acaccatgat 
gtgggcccac 
gctctgcccg 
actcggtagc 
aactcactga 
gcacacaaag 
tcagaccctg 
gctgcagcgg 
aatccagact 
tgacacacaa 
gcagttaggg 
ggtgcccgaa 
ctccaccccc 
acctgcctac 
gcacggctga 
gaaagtgctg 
gagctaaaca 
ttggttaacc 
atctacagga 
gcacttgggc 
ggaccactag 
tcaggaaaaa 
acacagtaaa 
ttccatttta 
taaaataatc 
ctcctcaggg 
aacgtgcaca 
ccccctttaa 
cggaaaaagg 
ttgttttgct 
ttggggtgaa 
aaactccaag 

tgggggatct 

acagctgttt 
ccacaaacag 
ccatactcca 
ttgcttgaaa 
tgcttaagtt 



ttggagagcg 

ggcttctgta 

ggcccctcct 

cctgaacctc 

gcaggaggca 

atcagcggat 

cacaaaaagg 

ggaggccgag 

gtgaaatccc 

aatgcaagct 

gcagtgagcc 

caaaaaaaaa 

acgttacagt 

aatgcccaga 

cggggagagg 

aaatctctta 

gccttctata 

actcgacacc 

ctaaagcctt 

ctggcctcta 

ctctgcatag 

caggggcggg 

agacaccaaa 

attcatggta 

gtagcatcag 

ggcgcagggg 

tgagtgagtg 

attgtaataa 

ctgcgctcac 

ctccccatag 

gtcaaacagc 

atcagccctg 

atcccccacc 

aattaaacgg 

aacaaatgct 

cacggggccc* 

ccactcagcc 

ggaacctcca 

cccgaggaga 

aacaaagcca 

taattagcgg 

accagggata 

ggcctggatc 

aactctattt 

gctcgaacca 

aaacccaaaa 

aacaaacagt 

agcggcgctg 

agctccagac 

actgtccata 

ctcctcctgt 

gactacaggg 

gccttaagga 

aacaggagcc 

gaaaaacaaa 

aaccacatgg 

tcctgagtcc 



cattaagtgt 
acctgaactt 
ctgagatgcc 
ggcaggagag 
tgagacctgc 
gaactgacaa 
gctgaaatag 
gccggcagct 
gtctctacta 
acttgggagc 
aagatcgcac 
ataaaaggct 
aaccaaggaa 
acaggcagaa 
gaacgggaag 
taacttgata 
ctttaatatt 
tttcacctag 
ccagaagctt 
agggcctctc 
ctgcaaagct 
gccttatctg 
gcaaaacagc 
ggaaggattc 
cttcctaagc 
ctctgtccac 
agtttgtcag 
cagcctctag 
tgcagctctg 
cctgggagag 
cactgagcat 
caacccgtcg 
cgtgtgtgtg 
gttccaaaaa 
ggatcgggtt 
tgctttggag 
cttctgggct 
gggggagagg 
gaggactagg 
tttccgtgcg 
cccacttact 
taacatcaaa 
accacattca 
ttgcctcaca 
catggacgac 
ctgcaatgag 
ttccccaaaa 
gcctgtactc 
aaggccccac 
gaatctctag 
tccccgctgt 
ccaccccgct 
ggcccagaca 
gcggcgtgca 
acacgcacat 
caatctccaa 
tcctgacctc 



ttccaaagcc 
ggccctcact 
agcactgact 
tctcaggcca 
tcgtgccggc 
atataatgtg 
gccgggcgtg 
cacttgaggt 
aaaatacaaa 
ctgaggcagg 
caccgcactc 
gaaacaccca 
gtcagccacg 
ccacagaggc 
tcaccgtgta 
gaagagaggg 
ttatattata 
gaaagatctg 
ccgctgcctt 
gcaggacacc 
tctttccctg 
ccattctatc 
aacagcatta 
tattcccatt 
ctcccagcac 
agtgctaact 
tggactggat 
accggctggg 
ccagggatgg 
gaacggggtg 
gtaaacccag 
ctgcaaggga 
agttctacta 
cgacaggaag 
caccggcgaa 
gggactctct 
ctcccagagg 
gggtgcctgg 
gcttgggagc 
ttcacagctt 
ggcttcgctg 
accgttctgc 
ctggggctcc 
cacaactgaa 
aggcatttga 
gtgaaaggca 
ccccttttgc 
cccagatact 
aatgtaaaca 
gcttgggtca 
cccccacacc 
gcaccattca 
aaggggcccg 
caggcacttg 
ccgaagagaa 
attaaaagaa 
aactccaccc 



tcacaaacca 
ctgccctccc 
ccccaacttg 
gatcctccca 
tgggagatgc 
gtccctccac 
atggctcaca 
caggagttca 
aattagctgg 
agaatcactt 
caacctgggc 
tacgtggtac 
aagacgcatt 
agaaagtcga 
atgggtatgg 
ttgtaaacac 
taagtttcac 
gctttagctt 
gtggatcaca 
ccgagggtga 
tcctcccctc 
gcactcaacc 
taccgggcca 
ttacaggtga 
aggaagcggc 
aactactcct 
gtccaaggtc 
gccagaaaga 
ggctaaaggt 
agataacaga 
gtgggtctga 
gaaagcaact 
aatgcctgag 
cacgaagtga 
tttcttggaa 
cagggtgctc 
gcccggcctt 
acaggccggc 
tggggaagtt 
ccaattgcaa 
aggtccgagg 
agaaagattc 
caggccttgc 
ctctacagct 
ctccagcaac 
acgcggtcgg 
ctccttgacc 
gtcgctgttt 
cgctcctgaa 
gaggcaggag 
tccaagcaga 
cactggaaat 
aggtcctccc 
cggccgtgcc 
tgatttaggt 
catgtgtagc 
cctgggaaac 



cagatgtccg 

agcactcctc 

tccccatcac 

ccagccacct 

aaccaaccaa 

acaatggaat 

cctgtaatcc 

agaccagcct 

gcgtggtggc 

aaacccagga 

aacagagcaa 

tacttggatg 

gtaagattcc 

ctggtgttca 

gttttatttt 

tgtgaatgta 

ctcaatttaa 

gcatttcctg 

accagactcc 

aggagcaccc 

tacatgggaa 

ctagcacttc 

ggtgcacgtt 

gaaaactgag 

caggctggaa 

gcccccgagg 

atacaggaaa 

tcgaggacgc 

ctcacacagg 

aactaggtat 

ccccaaaccc 

cagaggcctc 

cagtgacaca 

atctccccag 

ctgaagaggg 

cacacagcac 

ggccttgggc 

cctggaacaa 

ctcagcactg 

caggaagcaa 

catgtatttc 

ctccctttcc 

tgcctaatgt 

ataattcttt 

atcaccccaa 

cctagaaacc 

caggcatttc 

tgtcttcacc 

agaggcagat 

gacgtgaaac 

ggctgcagcc 

attcagggag 

cgctaaactg 

acttggccag 

agcaagaggc 

gtttcacgac 

accaaaagtt 



25320 

25380 

25440 

25500 

25560 

25620 

25680 

25740 

25800 

25860 

25920 

25980 

26040 

26100 

26160 

26220 

26280 

26340 

26400 

26460 

26520 

26580 

26640 

26700 

26760 

26820 

26880 

26940 

27000 

27060 

27120 

27180 

27240 

27300 

27360 

27420 

27480 

27540 

27600 

27660 

27720 

27780 

27840 

27900 

27960 

28020 

28080- 

28140 

28200 

28260 

28320 

28380 

28440 

28500 

28560 

28620 

28680 
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ggatgagaaa gttcccccgc cctacctctc cccacgggag tgtacaactg aggcacaagc 28740 

ctgcctcccc cactgccccg cgatctggga ccacgtctcc tccgcgtagc cgacccgggg 28800 

atggacacta tctggggacc cggcggccac acggggcatt cgggtcgccc gggcacctgg 28860 

caggtgtcag tccgcttgga aacccacagc cacgcggctc acaggagcag cgccaccggc 28 920 

taggccgccc cgcgcccggg ctcagaactt tctcgctgcc acttcagccc gtcctcggag 28980 

cacgcggggc ggccgcgcgg ccgctggaaa caggcttgcg aaccggctcc ccgggccagg 29040 

cccgcctccg cgccccaagt ccccgctcgg tgcccggccc gggccacacg ggcccagcgc 29100 

gggctcggct cggctcccgg cttcccgcgg gctcgggcag gtgaggaccc gcccgcgccg 29160 

cacctggcgg agcgggcgcc ctcctcgcca gcccgggacg cagcgtcccc ggggagggcc 29220 

cgggtgggga gacaaagggc ccgcgcgtgg cggggacgcc ggggacggca gggggatccc 29280 

gggcgcgcgc cccaactcgc tcccaactcg ccaagtcgct tccgagacgg cggcggcgcc 29340 

cgcgcacttg gccgcggggc cgcccgggcc attgtccgag caacccgcgg cccgtcttac 29400 

acgccgggcg cgggaaggta tcgaatcagg 294 30 



<210> 8 

<211> 33769 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> unsure 

<222> (33739) , (33749) , (33758) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 8 

cttcccctta cactggtcct tcgacccgcc tcggatgaaa actgaatggg tttagcctta 60 

gaggctctcg gtctctaagg gaggtgggtc aggatgccgg ggacagggtc ctcttcctgg 120 

ggcaacgtgg gggaacgagc cacctacccc tccactgaat tgccctgggg tgtgggtacc 180 

gacggctcat tcggtgtcca gggtctgaga tgtgttgaca ggaagaatga aaggggatgg 240 

gagggatggg gcgaaagaag ccacctgcag ccccaggaac tatctggcca gcacaccgtc 300 

acccagcggc ctgagccacc cctgccagag ccaggaggag accctgccaa tgggtcacca 360 

gtgtgcagga actcagaagg tcatcacagt taataccctc catgccccaa tgtgggaaaa 4 20 

caggtttttt cacaacaaac aagataattt ttgttatttt ggcaaaagga ggcagggcag 480 

ccccggacac ctccatccca cctcatcacc cagccgcagg gccccggcca tccctgcaga 540 

cagagtggat gtcacaacct ccctgcaccg aaccaagtgc agctcccagg ccacaggcca 600 

cccaggaaag gtccagtggc ccccggaggc tcccaccgca ggcctcccac cacagccggc 660 

accaacccag gatagctgtg ttctcctggc ttcttttcac acgggtagca gaaagctgag 720 

atccggggaa agctgagatc cagggaaagc tgagaatcgg cctctgctgc ccggacgccc 780 

acccccagct ctgctcccag ctccagggcc tccttctcag gtgcccttac aggaggcaga 840 

gggcttgagc cacctcctgg gcctggggca cgcaggatga acggggtcac ggtgcaggcc 900 

actgtccact gcgcagatcc caaggccata aacagcctgg ccacagtggc ttcccagctg 960 

gcaggcggcc agattatttt tgttgtttag caattgatta agtttctccg ctgcccccag 1020 

gggtaagtgg tggggcaaat gccgcaaccg cagcatttga cccgggatcc tgtgccaagt 1080 

gaccataggg tcacaaagca caagggaagt ggctgggccc gatgctggct ctgctggaac 1140 • 

ctgaggccgg ccactgtcac ctgcacggtg cctgggacct tccagcaagc acagagaagc 1200 

tatggccctc caggagcagc tggcaggcac cttggcctgc agtcaggggc tctgtctgct 1260 

cagctctaaa acaggaaagt cgctgctctg cctggggtca gggcagccag agagtgacca 1320 

agtcagtgcc ggcctcagga agggacctgc aggcgggtcc cttcctctcc catccctcgg 1380 

tgccagccag cccctcctgt ggccccccac tgcctgcctc tgcccccatg ccccaccaca 1440 

acctcaggcc catggctgca tggccactcc ccaggcaggc agtggggatg ggatttcacc 1500 

atgttggcca ggctggtctc gaactcctga cctcaggtga ggagttccta aagtgctggg 1560 

attacaggcg tgagccaccg cgccagccct ccctgtggta ctaaacactc acaccccctt 1620 

gctggggacc ctggtgaggg aacacagcct cacaagtgaa gtgtggtttt gttgagcaaa 1680 

tgacgcctgg gcagccctct catctttgcc taaaactgaa gaatttaggg gcgtggatgt 1740 
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ataaaacagt tggtgactta aatgaaaaag aaggccacac tccccccttt aggcaggcgg 1800 

cctaattctt taaaagccag cacagggtgc ctttctgaac ccaggcacac agtaggtgtt 1860 

caatggacag cagcggttac ttgtactgct catgacaccc tgtctgtggc ctctgcagct 1920 

ggctccagcc tgacgcatgg ctgcgcccct ccgcaaggcc accccggtat acatggaaac 1980 

tctgtggaga aggccttggg ggccggccag gacgccaggc ccagatccca tctgcgccct 2040 

tcctccatag acctcagcga gctctcggca ccatgtgcct caggcccatt taagaagtag 2100 

ggccggccag gcatggtggc tcatgcctgt aatcccagca ctttgggagg cccaaggtgg 2160 

gtggatcacg agatggtcag gagatcgaga ccatcctggc taacacggtg aaaccccatc 2220 

tctactaaaa atacaaaaaa taagccgagt gtggtggcgg gtgcctatag tccaagctac 2280 

tcgggaggct gaggcaggat aatcgcttga gctcagcagg cagaggttgc agttagcgga 2340 

gatcgcgcca ttgcactcca gcctaggtga cagagagaga ctctgtctca attaaaaaaa 2400 

aaaaaaataa aaaaaagaag cagggccagc cacggacgac ccctcacaca gctcccagga 24 60 

cgcgtgcctg ggtatagggc tcaggaccat gaccgctgca gtggccccca agaaacgtta 2520 

cttttgtcac ccaccccgcc tcagtggcag tagccaaaat aacggattag aatggaacca 2580 

tgtgacaatg ccactgcccc aactgacaga agatggctat cagcagttca cgcggcccca 2640 

cctatcacaa gtgcagggca ctctacaact tatgcatcct tccccagaca ccgtcctttc 2700 

gaccctccca ggtcagcaag gcacacaggg cctacatttc acagccacac agcagagggc 2760 

tgaggctgga actcggatgc tctgatttcc gttcaatcac atccccagag gtggcacaga 2820 

gacggggggc ttctcttgac aaagtcaaga aagtcactgc cagctccact gaagaccaaa 2880 

gaacctcagc tctcaaaccc tcttgaaggt gttaccgaac tctcccagcc tgtttcctgg 2940 

gtcccgatgt tggtcccgtg ggacacagga agaggaagaa gctccctaga gcagagcctg 3000 

gtgcacctgc cacactctca gagggctgcg cacgggcgga ggagccgtgt gcaggagtgg 3060 

ggtctggatg gaggggcgct gtggccgggg gcagggggca ggggaagggt gctccaggtg 3120 

gtgggcacag cacgagcagg ggcagggagg tccacactca gatgtgcaca gggagaaaca 3180 

aatcgtgcat ttccattgga ataggcggta aaaggtagaa aaacagagtg ggggccagga 3240 

agggagtcgg agccttctag tgtctctctg caggtgagcg gcagcccgag gtgtcagctc 3300 

agcagacttg gggtccaggg gccgtgtctt ctatcactga ccccagggca cacggaactg 3360 

gggagggaga gcagaggcac agggcacggt cagtgaaacg aaacaaggag tcatcaccaa 3420 

atgcggaaag ggcaaggagt gcccgcagcc gcacaagggt tctgtctggg caacgtgggc 3480 

gtcccaccag gccccgcacc ctgcaagcgc aaagctcgcc actgaagata aagggaagct 3540 

gttggagctg cggagctggt ctggggtccg catggagctg ggcttatgct gcagtcacaa 3600 

gggggacatg gaagaggctg caggggacaa aaccagtgac cacagtctaa ctctgagcct 3660 

gtggaaaggc gcccacagca ttcacccatc ccagagatgc cattccccct gtgcccccgc 3720 

tccacggtga cagcgttctc caggaatatg atgcgcccct ctcctcttgc atcagccctg 3780 

acagtgagta ttcaggccaa aaagcagaag agcacagctg cgtggttcca tttccatgta 3840 

gttctggaac aggcaacgct aatccaaggt gatagaagtc aggagagtgg tggagggggc 3900 

gggggttgag gatggcaaag gggcaccggg aactttccca gtggtagaaa tgttctctgt 3960 

ctggaccgtg tggtagttat gcagacatat gcagctgtca aagttaatcc aaatgtacac 4020 

gttaaaatgt gtgcgtttta ttgcctgcaa gttatacctc aattaaaaaa ataaagttag 4080 

cactcaggct tcttccacaa cttcctgaac cgtgtgagct gattttcttg ctattaaaaa 4140 

ttcacggtcc atggctgaga acagcagctg ccttctgttt gcaaagtcaa cgccaatcac 4 200 

tgcccggccg cggcagactc ggccccacag gacctccttt cttttttccc tttgacctac 4260 

ttccctgata agtgacaaga cagccagact ctgggaacaa acgcccgtta ttcggccccg 4320 

agctgagcgg gccctgcttc ctgagctaat ccgcccggac agacggaggg acgtgagggg 4380 

ctttgccgtc ggctccagct gtcagtctgc ccgtcagact cgacagtggc cccctctgtt 4 4 40 

cctcccgctg cccccactcc atccccgact tctttttgtt tcctgtccct gacagacgaa 4500 

catctgttaa aactctgtct gggtgagctg tggccagcgg cccacaaatc cccaagccgc 4560 - 

accccagcct catctgggcg ctgccgggag cactgcctgg ccaccctctg gacatagctc 4620 

tgagagccac cggccagggc acgtgtggcc cgagtggcat ggtgcacgcc gctaagccca 4680 

ctgcccaaag gcccccaagc aggagggatg tgcaggagac aaaagtcaaa agaacagggg 4 740 

cacgttccac agaggatggg gctggagggg tggcagtgag gaacagcagc ttccgaggat 4800 

ggcggtggca actcccaaat aaggcctcac tcctgctgtt tttagctcat tccacataat 4860 

tggaaaaaca tggcagaaac cgaagccagc tgctgccttg gtcctggggc tgtgtggagg 4 920 

gggtggggag gccggaggcc caggctctgc actcgactgc tggggatgag agtgactctg 4 980 

agctgcagag agcagcatcg cagccgccat ggtcccattg agccccggcc acgctgggcg 5040 

gcagaggctc gtgggatata cctgccctgt ctcatggggg tcacttcagg aggggcgggg 5100 

gagccaggac acagcccagg gctagcggtc accctgcagc tcaggggcca cgtaaatagt 5160 
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gccaccttga aggcacacag cagtgcgggg ccccccccgc caccaacgca tccctacctc 5220 

taggaggccg cctgtgtgcc cctgggaacg ctgctccctg tcccttgggg tcctggtgtg 5280 

accaccctct cagccccttc cttggggaag gcacctgact ccctacaccc agctggcttt 5340 

catttgctca aaatcaggaa aaagcagaat tcaagacatc acagaaatgt cttcgcctgt 5400 

aactccatga aagataaacg gtcagacacc caggagggag tcccagggac ccttgagtct 54 60 

cacctgaggc tctggcttca aacctcgaga tgtttccagc catgctagcg ccgcccccca 5520 

caacctgccc cacacagtcc tcccttggga actcacagat ttggccccca cctgccccgt 5580 

ttcttctggt ggagtgggtg cgttgggttg gggtggggct ggggactctg gatgtgtctt 5640 

aagagtctga gtgattctga cacagccagg ccctgccccc ctcctgacct tcgccccaca 5700 

ggaaagggag ccacacgcct gaagcgccca gcacaccccc ctccgtcctc cccaggtcac 5760 

ccgctggccg tgtgagccgt gctccccact gccccttcac ccaccccagc tcctcctggc 5820 

agcacccagc cttggaagct acttctgatt acaaccgccg aaggaagact cgctccctcg 5880 

gcactgaccc agacagcctg caccatcacg ctgctcagca caacccacac agccttcctc 594 0 

caaaccccat ggagcgggga gtataatcac cccctttcta ccaacggaca aactgaagca 6000 

cagagaggtt aagtcacttt cctaagctcc caacacgatg acaaaaaata gaaggtcagc 6060 

ccgcaagtgg aactaggtgc tccaagtccc cggtctgcct gacactgcac ctcctcgccg 6120 

ccacggtccc gggtccgcct gacactgcac ctcctcgccg ccacggtccc gggtccgcct 6180 

gacactgcac ctcctcgccg ccacggtccc gggtccgcct gacactgcac ctcctcgccg 6240 

ccacggtccc gggtccgcct gacactgcac ctcctcgccg ccacggtccc gggtccgcct 6300 

gacactgcac ctcctcgccg ccacggtccc gggtccgcct gacactgcac ctcctcgccg 6360 

ccacggtccc gggtctgcct gacactgcac ctcctcaaca ccaccacggt cccgggtctg 6420 

cctgacactg cacctcctca ccaccaccac agtcccgggt ctgcctgaca ctgcatttcc 6480 

tcatcaccac agtcccgggt ctgcctgaca ctgcatttcc tcatcaccac ggtcccgggt 654 0 

ctgcctgaca ctgcacctcc tcaccgccac ggtcccgggt ctgcctgaca ctgcactttc 6600 

tcaacaccac tccttggccg gctcccaact acaaaccaag ccatgtcttc catcctgaat 6660 

cctcttggcc taaacatcac tcacaatgcc tccctcggga acaggcacaa gtcccaccag 6720. 

cacagcctcc ttcgttacct gcgtttccgc tagcccaggg ccagctccag agccctcacc 6780 

acagagcctc tatccttcac ccccggacac tggacctcac caacccatag cctggaggag 6840 

atccctgtgt gaccccaggg cctcctctgc ccgactctga atttcactgc ccaacgtgac 6900 

acctcggaag gctctctggg cactggcagc cctccatggg caccgctcct tctggccagc 6960 

tctgacatcc cggctggtga ggtgccctgc acgaggcctc tgcccactgg gacctcacag 7020 

ccgtgctgtc agctgcaaca agcgacagaa tttcacgttt tcttcacgtt gcccctgggt 7080 

gagcagctcc aggtagtttt cagtcgaggc gaggcgtccc gtcagcagcc aggcggcaca 7140 

gctaattcat gcccgccggg cgcacggccg caataccaat gggcacctgc agcctggaaa 7200 

gccacagagg aaccgagaac agcgactgtg ctcaggtgac aggactgtgg tcttttaaca 7260 

aaacattttc ctttaacgtg atattttacg gcaaggaatg aaacctggag ggcaggacat 7320 

ttggatacta aagccccagg ctgccgcgtg gtctgctttg tgaagtctga agcccgcgcc 7380 

ccattctggc cccgctcaca ggtccggctc tgactcacca gcttcaatgc taggccgtgc 74 4 0 

ctgtcctcca accagaacat gacttcctta aggacaaagc cgtttctcgc ccatccccat 7500 

ctccctctgg attaagaaat atgggaagat cttctagaac cacctcaaat ttgcagagag 7560 

ccatcctggt gacaaaccct tgaaatgctt ctaagaagag tttaggtttc ttctcaactc 7620 

taaaacctct agaaaactct atttccacac cagctgcccc tggaacactt cagcttcaaa 7680 

agggcccagg gcagggagac ggaggagcca gcatccacac cgagcaccag cctgttaatt 7740 

aacgggaagc gggtggggcc catctccagg cagctctgag gtcagactgg ggaaccatgc 7800 

ttacaaaaaa aagtgaactg aaacgctcac gtcctcatgc aaaaccagac tcccagttgc 78 60 

atctttctgt ctcattgagg agctttttcc tccctttgac agaacaccct acacacggca 7920 

tctggaacca aagcagaaag attcaggctc agagtaaaac agtccccaca ctggctgcat 7980 • 

gtggacgttc ccggcccaga gtctcgccca agcagggcct ataaatgaca caaaatgttt 8040 

ttctcctgcg tgccagtcat gctccaactg agttatgtgt aaaagtgcct ctcacggctg 8100 

agggcaaaaa cagttcccac aagactagag aaaggtgacc cctgacggct gagtctctag 8160 

ggagcgtgga gctgcgtgct cagccctgcg gccctgacgg ctctggaatg gaaaagctat 8220 

ccaactggaa gggcagggct cgctgctagt ccagcggtcc aaccccacag gtgtctgtgg 8280 

tgtcagctcc atgccacaga gcccagggct ggggccagag ccaccaggcc ccctgccagc 8340 

ctgcaggggc ctcctcctct gggtagccta accaccccct gtgagcgcag gcagcctcct 8400 

ctaatcacca cagggcctgt ccccccctct cccccgcttg caggaaaatg agccctgagg 84 60 

actccccagg gctgctctgg gcctggacat ggagactggg aattacattt gcagaaggag 8520 

cgcaatgccc ttgaagggct cagccacgag cagccagtcc ccagggctca gaaggcccag 8580 
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ctgttagaac 
cctcttctgg 
ccccaggtat 
cccagctctc 
atgaaggtga 
tgccaggcac 
ggtagcctgg 
gccatgagtc 
gagccacaca 
cccaaatagc 
agagacaggg 
ttccactgtt 
gtgtccattc 
aacagagatg 
agcagatgca 
gccctgctcc 
aacccacagc 
gcgggaggat 
agtggctcat 
ttaggagttc 
aaattagcca 
gagaattgct 
aacagagcaa 
atgtctttga 
aataaatagc 
aaaaacgcag 
ccaagagagt 
cccgttacag 
aaggtaggaa 
cccgtgaagt 
ccttctcgcc 
acctgccggc 
gttccgtacc 
tggcccccac 
cttctcagat 
tcagactagc 
tgttgcccag 
gctcaagcaa 
actgcccagc 
ggtcttgaac 
acaggcgtga 
gtgattctct 
gagcccgtat 
cagggtccag 
aacaaactct 
tgaggcttgt 
ctgcgaggga 
ctcatggctc 
aaccccaggc 
tcaaaccact 
cctggtgtcg 
tctctctcaa 
tgatggagga 
ctcacttgtg 
gcccgggatt 
caagtgttcg 
agcagatgaa 



cctgggagcc 
ttgggaaaga 
gggaggattc 
acgggcctgg 
taagggcatg 
ccagcacggc 
gcaagttatt 
tgggatttta 
gggaccctta 
tgggattaca 
tttcaccatg 
agggcaaggc 
cccactcctt 
gtgacacctg 
acacagtgtt 
ctctaactcc 
agcaagactc 
gactaaaaga 
gcctgtgatc 
aagaccagcc 
gacatagtag 
tgaacctggg 
gactctgtct 
tacatattta 
aggaagcctt 
ggagggctgg 
aagagaccac 
aatttgcggg 
tttcaggtca 
cggtcccaac 
tttccctctt 
tcaggcggcc 
cgatccaggt 
cacacgtgtg 
attttctgga 
caggacgagc 
gctggagtat 
ttctcctgcc 
taatttttat 
tcctgacctc 
gccactgcgc 
ctcccctccg 
gtcctcactc 
ctccagcact 
gagaaggcca 
gcagccgagg 
gaccagggcc 
tgggagccag 
tggtgacttg 
cctccccttc 
ccaggtcact 
cattgccttt 
gcgggtgtcc 
ccgtctcccc 
ctgtctgcac 
acaaatgcct 
tgaatggggt 



agcaaagagc 
aatcaacgcc 
tgggacgatg 
caccagccac 
gacagtggac 
tccgtcaccc 
tgggggtctc 
tcagggccca 
acaaaagctc 
ggcgcacgac 
ttggccaggc 
acctgacagg 
cgacaaatgt 
cttcctacat 
taggcagaca 
agtggccagg 
agaaatggca 
catgtcttga 
ctagaacttt 
tagccaacat 
cgggcgcctg 
aggtggaagc 
caaaaaaaaa 
cctcctgcaa 
acggttcgat 
ggaacaaaaa 
ctccagacct 
gaccgctgca 
acaacagagc 
accagcatgc 
ttagtccttc 
acacccccag 
ctgtcctatg 
cctcggggca 
atatttgtgg 
tgcatttttt 
agcggcatga 
tcagtctccc 
atttttagta 
aaatgatcca 
ccggcccgag 
tcccccggcc 
tttccctttc 
gagttcagcc 
aggttcccgg 
caccccctcc 
cttgctccac 
accccacctg 
ccctggcctc 
aagggtctat 
caggaggcag 
gaggccgagg 
aagacactca 
cacccaccca 
cgcggggtcc 
gctgaatgag 
gctgtccaga 



caggggctcc 
cctttactgg 
caggcaaacc 
agcacctaag 
atggcagctg 
ctggatgagc 
caagcttgtc 
cacccgttcc 
ccagggcaac 
taccgcccgg 
tggtcttgaa 
cacgactgca 
ccacacccag 
gcccattgct 
tcaccaatcg 
ccccaagcca 
aaaacacaaa 
taagatattg 
aggaggctga 
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tttctcaaat tctccttctc ccgacttggg aaacaagatg cttggtcggg gaggctctcc 3L2O60 

aaccatcccc tgcagcagcc ggcacagcgg acagaccctt tgatgtaaca gccatgtctt i2120 

cattaaagat gccctgctct cagaaagaga aagacaaata caaacctgga aaatcctcac 12180 

caaacgcagg acccctgcca gggagcagag aaaagaccca cacgccacgg gcgccacgac 12240 

cacacacaca ccccagccgc tgcacacaaa cacagaccct agccagcaag aacaggggga 12300 

ccaggaaact gttcctaaag tcaggacccc catgtgctca gacagcagtg agagcaagga 12360 

cacttctcca tccaccggat gccaggagag tccttttagg gggccccaca ccgagactct 12420 

gcccttagga ctgttcctga gtgtggaagc cagcccactt ggaagccccc tgccctcccg 12480 

agtgggacac cggcacagga agcaggccct gtcccccacc actttctgca agctgggccc 12540 

catcacgcta cagaaacggg gaggactggt cccagggatg gcgctttcct gacacctctc 12600 

gttaccccct cgcttgccag gccccagggt cagccccaga ggccagactg gctatcccag 12660 

gcccgggagc atccccgaag gcgagctgca tcctgaacgt gtgtgatttc ccgaagggcc 12720 

cgccccgaac cgacacctgg aaagaaagat cctcagccgg tgccccagag gagaagagcc 12780 

atgcctcact gcaacacagt cccaggaagc accaagtgcc tgaggaccaa ggcggagagt 12840 

aaaaaagtgg aaaatatctg gggcaaaaat aaaacaaaac aaaacaggat tgacctcctg 12900 

ggctcaagca atcctcccaa ctcagcttcc cgagtagctg ggaccacaga cttgaatcac 12960 

cacacccgcc aagtggatca tttcgaacgg gtttgccgag gttccttctg gggcaccccc 13020 

ggcggccgca acccattccc gccaggcccc gccccgcccg cccgccccgt cccgtcccac 13080 

cgcctcacct gccttacacg tcctgccgtt gtcctgcagc tgcacacccg tggggcaggc 1314 0 

gcatgtgtag aaaggctcgc ttggggacag caggcacagg tgggagcagc cgccattgtc 13200 

ctcctcacag cgagtgtgga ctgagaaaac caggacagac tgagagaagg ttccagaaga 13260 

ggaccgtcac ttgtttctga atgagtcaca tcctgcctcg tcccccgtga cagcctccag 13320 

tgtgtccctc tgcccaaaca tcggcctcaa gtggcatcag ggacctcccc gcgggcacca 13380 

ttccacctgc ctcatcgctg gccccgtcca catggggccc tcagcctggc cagacggcct 13440 

gcaatttccc caaaaccagc cgtgaccttc ctggccaccc tcacacccag atgtgacctg 13500 

cccatggagt gacatcctcc ccatctgctt cctcccacca agctcctatg actagaacac 13560 

cctccccagc tcctcggagc ccccaaagga cacccctctg caaaggctgc cccccacgct 13620 

ccaatggccg gggtcaggac ctgcctgtgt ggtagtgacg ggaaccccag agacaatggg 13680 

ctcctgggca aaaggcttgt cttgtctttg tgctatgtgt ggacccagca gcttccatag 13740 

gaacactgtc cttcttgctg ggatggccaa gcttgtcact ctcccaagcc ctcctatgac 13800 

caacagcaat tgaacggaac tcgataaatg cttccagcac ctcattcaaa ccaggggaaa 13860 

gctgggtgta gcagccccaa aatacggata taactggaac aacaaactca tcaaaatgaa 13920 

cctctccctc cctcatgctg ccccaagtgt agatgggttt tgtgaccacg actttctcac 13980 

caggaaacag ctccagagag ccccaccctc ctgtgtcctg ctctgggaac agctggcacc 14040 

cctaggcccc acatttcaat tcaaagtcca aaccttccat aatggcctgg ccagaaatct 14100 

ccatccctgg tccctgtggg agtgggccac tgtccccaga gccgcagccc cactgtcaca 14160 

gaagctggtg catttcccca tcagggacct ctgtcacaac ccagcgtggc ccccaggctg 14220 

agaactgctg attctgggca gattattcat tgataaatac gcgacttgca gggccaagca 14280 

tggtggctca tacctgtgac cccagcactt tgggaagtca aggtgtgagg atcactggag 14 340 

cccacgagtt tgagacaagc ctgggcaacg tggcaaaatc tctcatctct attaaaaata 14 400 

catacacaca cacacacaca cacacacaca cacatatata tgtatatata aataaccata 14 4 60 

tatatatata cacacatacg tgtatgtgta tataaataca tatacacaca cacacagaca 14 520 

acttcttctg ggccttgaaa acgaggcaac cttccttgga aatccccttg ccactgctga 14 580 

gcctgaaata gcccccatga gctctgcaga ggggtcctct gcaggcccgt gtcccccagc 14 64 0 

cagccacaca cctccctcca ttgcagcagg taccccttta gagagggggc cccccagagc 14 700 

atgggcttct gcagggaggg gtcacctgcc cccccacccc acccacgccc gcgcaccccc 14 760 

acgcccccgc atcctcccac tcccctgccc cgcgcccccg ctccccccag ccccctcacc 14 820- 

ctctcccccg tgccccaacc ggcactcaca aaaaggctgc cgctcctggc tcagcacctg 14 880 

gatgtccatg ggtgagtata gggcactcag gatctccttc ctcttccccc cagtgcgctt 14 940 

gttgcaggca tggatggagc gggtctgcca gtctgtccag tacagagtgt ccccggagag 15000 

cgtcagggcg aaggggtgcg tcaggctgcc ctccaccacc ttctgcctgc agtcagggaa 15060 

gcggggtgga ggagccatca ggagggtccc ccgacagtca ttgctgctga cccaattaat 15120 

ttcttttttt ttttttgaga tggagtctcg gtctgtcgcc caggctggag tgcagtgatg 15180 

taatctcagc tcactgcaac ctccgcctcc cgggttcaag caattatcct gcctcagcct 1524 0 

cccgagtagc tgggatcact gatgcccacc actacgccca gatgattttt gtatttttag 15300 

tagagacagg gtttcatcat gttggcaagg ctggtctcga actcctgacc tcaggtgatc 15360 

cacccacctc agcctctcaa agcgctggga ttacaggcgt gcgccaccat gccaggcttc 15420 
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ccatttgctt tcaaccagac aagtgaggcc aggtcaagag ccccaggagc tggcgccctc 154 80 

gtacatttct cccggcgtgc acagggcacc tcccaaacac agcctgtgat ggtgacacac 15540 

gggctccccc aggtcaagtg gcaaagtctc ccccagggaa gaaaggagga agccatgcct 15600 

ggcaaaaagc acacctctcc tgcccaacgc tttaacctct gtatacaaat caggccatgt 15660 

gcactcgctc cttcttacaa tgctcataat ttatactttc agagtaaatg aaacttggca 15720 

tcaacccgag aaacagctat tcttttctag atgcttacag tgcccagcaa atgaggactc 15780 

gggtgtaatg agattatgga cactggaaac aggatcataa tgtgacgtgg tcggtaatgt 15840 

gcagttttat ttgcttaatg accctcgccc cgtgacaggc tccctgaggg tgggcctggg 15900 

ggcagaggtc cccgccacgt ccccagccct cagcacagtt gccaggagag ggtgacactc 15960 

atgaagtggc acagggaaga tgggagctgt gggctctgca gatccaccac ctcttctgtt 16020 

catttttgtt gatgctgttt tttaagaaaa ttattgaagt aaaattcaca ggacatacgt 16080 

ttactttttt tttttttttt ggagatgggg tctcactctg tcacccaggt tggagtgcag 16140 

tggtgtgatc tcagctcact gcaacctctg cctcccaggt tcaagcgatt ctcccacctc 16200 

cgcctccaga gtagctggga ccacaggcgt gcaccaccac acccagctaa tttttggggg 16260 

gtatcttttt ggtagagaca gggtttcgcc atgttgccca aggctggtct tgaagccctg 16320 

agctcaggcg atccacccgc cttggcctct caaagtgctg ggattacagg cataagccac 16380 

tgcacccagc ctaaatttac cactttaaag tgaatagtgt tacctagtgc attcgcaagg 164 40 

cggtgcagcc tccacttctg tctagttcca aagcacttcc attgccccac aggcaaaccc 16500 

cacacccggc agcagtcatg ccccagtccc cgcccccagc cccggcaaac acttttgatg 16560 

gacttaacta cacacattct caacatctca tataaacgga atcacaatat acagcctctg 16620 

atgtctgtct tctttgactt ggcaccatgt tttcgaggtt catccaggct gtagcatgtc 16680 

agtgcttcat cccgttttag gggtgaacca tattccagtg tgcagacaga aaccaatctg 16740 

tgcatccatt cacccactgg gggacctttg tgtcatttcc accctcggct gttgtgcaca 16800 

gtgctgctac ggacattact gtccattcac attttgtgtg aagacctgtt ttcgattctt 16860 

aagagtatac agctaggagc ggaattgctg ggtcatacgt aaatcaatgt ttacgtctca 16920 

aggaatcaac aaactgtttt ccacaatgtt gtcttttttg tttgttttct gagacagggt 16980 

cttgctctgt cacccaggct ggagtgcggt ggtgtgatca tggctcactg cagcctcaat 17040 

ctcctaagct caatccatcc tcctgcctca gcctcctgag tagctgggaa cacaggtatg 17100 

taccaccatg gccagctaat tttctaattt tatttttttt tgtttttgtt tttttgagac 17160 

agagtctcgc tctgtcgccc aggctggagt gcagtggtgc catctcagct cactgcaagc 17220 

tctgcctccc gggttcacac cattctcctg cctcagcctc ccgagtggct gggactatag 17280 

tcaccggcca ccacgcctgg ctaatttttt tgtattttta gtagagatgg ggtttcaccg 17340 

tgttacccag gatggtctcg atctcctaac ttcatgatcc acctgccttg gcctcccaaa 17400 

gttctgggat tacaggcgtg agccaccacg cccgacctta cttttaattt tttaatttta 174 60 

ttattttatt ttattttttt tttttttgag acagagtctc gctctgtagc ccaggctgga 17520 

gtgcagtggc gggatctcag ctcactgcaa gctccacctc ccaggttcac gccattctcc 17580 

tgcctcagcc tcccgagtag ctgggactac aggtgcccac cacgatgccc ggctaatttt 17640 

ttgtattttt agtagagaca gggtttcact gtgttagcca ggatgatctc aatctcctga 17700 

cctcgtgatc cgcccgtctc agcctcccaa agtgctggga ttacaggcgt gagccaccgc 17760 

gcccagcctt tttttttttt tttttttttt ttttgagata gagtcttgct ctgtcgccca 17820 

ggctggagtg cagtggcggg atctcagctc actgcaagct ccgcctccca ggttcacgcc 17880 

attctcctgc ctcagcctcc cgagtagctg ggactacagg cacccaccac cacacctggc 17940 

taatgttttg tatttttagt agagacgagg tttcaccgtg ttagccagga tggtctcgat 18000 

ctcctgacct cgtaatccgc ccgcctcggc ctcccaaagt gctgggatta cacgcgtaag 18060 

ccatggcgcc cagcccatgt ggccattttt cagtgagaga agccagaggc ccatcactct 18120 

cggttgctcc ctgggccatg ctctgcctca gccagaagca ctgagggaag gtcagcctcg 18180 

gcccttgccc cagccacagt cacagataaa ggggcctgca caggtctgtg tggctccaga 18240- 

gctcgtcacc caacacacga cgcttccatg tgaatagccc caggtgcatc atgaagagcg 18300 

atggccgctg cagaggcaga agaatcccgc ggggaagcag gtgggagaga ggctgagaac 18360 

agaccagacc ctggagctac agaccctatg ttccaaccct ggctgggact agctgtgtgg 184 20 

ctctgggcaa attcacatgc ttctctgtgc acaggggatc aaaatagcaa acacaggcta 18480 

ggcacagtgg ttcacaccta taatcccagt gctttgagag gccgaggtgg acacatggct 18540 . 

taagctcagg agtttgagac cagcctgggc aacatggtga aacctcgtct ctacaaaaaa 18600 

aataccaaat aaattagcca ggcgtggtgg tacgtgcctg tggtctcagc tacttggaag 18660 

gctgaggcgg gaggaacact tgagcccaag aagtcaaggc tgtggccgcg tgtggtggct 18720 

cacgcctgta atcccagcac tttgagaggc tcaggtgggt ggatcacttg tgatcaggag 18780 

ttcaagacca gcctggccaa catggtgaaa ccccgtccct actaaaaaaa tacaacaatt 18840 
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tgccaggcgt ggtggcgggc acctgtaatc ccagctactt gggaggctga ggcaggagaa 18900 

tagttagaac ttgggaggtg gaggttgtag ttagccaaga tggtgccgct gcactccagc 18960 

cagggggaca gagcaagact ccatcccaaa aaaaaaaaaa acaaacaaac aaacaaaaaa 19020 

agaggtcaag gctgcagtga accatgattg tgccaatgca ctccagcctg ggtgacaaag 19080 

tgagaccctg cctcaaaaca ataaaaatat aaataaaaat aaaacataat agcaaacgtt 19140 

tcatagaggt ggtatgagca ttaaatgaac tgataaacgt ccctggaaaa cagtaagtgc 19200 

tatggaagga ttcgctgccg ccaccgccac caccattagc atgtttcaac ctccatcacc 19260 

ctcactgtcc cctgtcacca tcctttgacc agggcactcc cagctgcagc ctttctatcc 19320 

tcttgtccac ccttcataac tgtaagatca ctcagctccc aagaaccaca gtctacaggg 19380 

taaccacatt tccaaatctc aaaccagacc cgctggtctg cacttccagg gacaacagga 19440 

tattttcaaa ccagcccaaa agagatgtgt ggctcagcat aagaggaaca ggagaaactg 19500 

aggcctcttg ccctgagaat gagcttggaa gtggatgtcc cggcctcact caaaccttca 19560 

gatgactgag gcccagccag gagcttgagt gtaccctcag gtcataccct gagccagaag 19620 

cacccagcta atccactcct catcactgac tccctcccca taaaaaacct gtttgctgtt 19680 

tcaggctgtt aagttgtggg ctgttttgtt acacagcaat ggataactaa cacacgaggc 19740 

ctggcaagtg tggagcaaag ctgcccaagc cctcaagtct gttcatgtgg gtgttggcct 19800 

gtgtttgcag aaatccagcc actgagtcct cccatgcagt cactactgcc ctctgcacag 19860 

acacctgcca catccctgcc tgggccagga gctccactag tgcaggaatg gggtctgccg 19920 

tcccaggagg atccctgaca cctagcacag ggctagcagc aggcagcact tggttagtga 19980 

ataaactgcc cttcacctgt acacagaagg gatgtttcta taaggggtaa ttaagtacag 20040 

agctgggaag ctatgctgac cagaaggctc taaaagcaat taaccaacga ggggaaaacc 20100 

cttcctactc attctcggcc cattttattg agcactgacc atgtggaagg ccccctggtg 20160 

agactgggga atgcaccaat aactgagaca gcttccggct gttgccctca ggatgcctga 20220 

gctgggatag ggccagggtg ggggtggtgc gtgtgacagg gttactgttc acaaccctgc 20280 

cgggccataa gccctcccca acaattccaa aatccaaaac gctctgaaga tggaaagctt 20340 

ttgttgctca tctggtgaca aaacctcatt tggtgcatgg gccgggtgcg gtggctcacg 20400 

cctgtaatcc cagcactctg ggagccgagg ggaaggatcc cttgagctta ggagtttgag 204 60 

accagcctga gcaacatgtg agaccccgtc tctaccaaaa atacaaaaat tagccaggtg 20520 

tggtggcgca ctcctgtagt cccagctact cgggaggctg aggcgggagg atcgcttgag 20580 

cctgggaggt gggggctgca gtgagctgag attatgacat tgcactccag cctgggtgaa 20640 

agagtgagac tctgtctcaa aaaaacaaag ttaaaaaaaa aaaaactgtg catgggtgtg 20700 

ggctacagat agtcttttct gccctactta gaatgaacgt gccacatttg ctatagaaat 20760 

attcaagggc tggtggcaaa tgccacacag accctgacgc tgttccaagt tctgagaagt 20820 

cctgcattcc tcagggcccc agagtttcag agaagagtct gtaggcctga gttaagaagg 20880 

aacgccttca aaagccctgg ggacaaaggg gaaaggggtg ccccaggact gcgtgggtac 2094 0 

ctaccggaac gagccgtcca ggttggcacg gtggatgaag ctgagcttgg cgtcagccca 21000 

gtagagcttc tgctcctcca ggtcgatggt cagtccattg ggccagtaaa tgtccgagtc 21060 

cacaatgatc ttccgggtgc tgccatccat ccctgcccgc tcaatccggg gcgtctcacc 21120 

ccagtctgtc cagtacatgt acctgtgacg ggggcagggc aagagaagca gctaacacag 21180 

atctgttttt tgtttttgtc tgcatagatg cagacatgaa acaacagaca gtgaacttgc 21240 

cctaaaatct cacccatcgg aaataaccaa caggtatggt ttcaggtatt cctgccttaa 21300 

gctgggcaat caaaatatac tatttccaac ttgttctcag ttaacagtaa attctgggca 21360 

ccttcccttc ttgtggatag aaagattcct tgttcttttg atgattgcct agtgtactct 21420 

gctgtaagtt ttttaaagaa cttcaggtta tttctgattt ttttgctacc atgaaaatgc 21480 

tgtaaatgaa cctctaaaag gcaattcaaa acactcagga tggaatatta tttagtggta 21540 

taaagaaatg agctatcggc tgggcccagt ggctcacacc tctaatccca gcactttggg 21600 

aggccaaggc gggtggatca cgaggtcggg agatcaagac catcctggct aacacagtga 21660- 

aaccccgtcc ctactaaaaa tacaaaacat tagccaggcg tggtagtgag cacctgtagt 21720 

cccagctact taggaggctg aggcaggaga atcatttgaa cccgggaggg ggaggttgca 21780 

gtgagcagaa atcgcaccat tgcactccat cctgggcgac agagcgagac tccatctcaa 21840 

aaaaaaaaaa aagaaaagaa aagaaatgat ctatcaagcc atgaaaagac atggaggaaa 21900 

cttaaatgca tgttagtagg tgaaagagcc aatctgtatg agtccagttc taaacactct 21960 

ggaaaaagca aatacacaga gacagtaaag catcagtggt tgccaggagt tggagaggag 22020 

agggatgaat gagtggagca cagaaaatca gggcagtgga actatcctgt atgacatgga 22080 

atggtgggtg catgtcctta ctcatctgtc taaaccaaga atgtacaaat caagggcgaa 22140 

ccctcgtgta aacgtggatt ttgggtgatg gtgcgtcagc cagctttcat cagttgtaac 22200 

aaatgtacca ccctgcacag gatgctgaca gttgggaagg ctgtgtgggt gtgaggacag 22260 
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ggatgtatag gaactcagta cctgctgctc atcaattttg ctgtgaacct acaactgttt 22320 

gaaaaaatta agtctattta aaaacaacaa aacatggcca ggcacgatgg cttgcacctg 22380 

taattccagt acttcgggag gctgaggtgg gtgggtcact tgagccaccc tgggcaacat 224 40 

ggcaaaatcc cacctctaca aaaaataaaa attaaaaaaa agttagctgg gcatggtggc 22500 

acactcttgt agtcccagct acttgggagg ctgacgtggg aggatccctt cagccctggg 22560 

aggtcgaggc tgcagtgagc tgtgactgta ccactgcact ccagcctgga tgacagagtg 22620 

agaccctgcc taaaaaaaaa aaaaaaaagg ctgggtgcgg tggctcatgc ctgtaattcc 22680 

agcgctttgg gaggccgaga tgggcggatc acgaggtcag gagatcgaga ccatcctggc 22740 

taacacggtg aaaccccgtc tctactaaaa gtacaaaaaa aaaaattagc cgggcatggt 22800 

ggcggacacc tgtagtcaca gctactcggg aggctgaggc aggagaatgg cgtgaacccg 22860 

ggaggcggag cttgcagtga gccaagatca caccactgca ctctcagcct gggagacagc 22920 

aacactccgt ctcaaaaaaa aaagaataaa acccatggct gggatggacc ctgaacctgc 22980 

agctgcagct gttcctgggt aggtctgtgg gcgacgtggc tttgcttctc catgttccca 23040 

agagacaagc atcacccatc catgagaaac aagcacatcc tcagggcgcc cttacgtgat 23100 

ctctggccaa tgaaccaaga caaagtgagc agacaccagg tctgggatgg caggtcccac 23160 

ccccaccagt gcccagtgtg ccctgtttgg aggtgaccac agggtgtgtg cccagaggct 23220 

gggcgtgact ctcagcggag accagagggg aaccacacca gcttggagga ctcagttccc 23280 

atcccagcca gctgggatga gccacaggac acaagggctg gcagacctat tgtgttttgt 23340 

ccacccttca cagcagagaa aggggacagt gcccagaatg tcctctgagg agcctcctcc 23400 

cactcttggt ccttgtaaaa tggtgctgac tcccttgctc ccttcttcct ggggtgggcg 234 60 

gcaaacccca ttcccctcag ccttagcaag tgatttagaa acaggcagct cgcccaagcc 23520 

aggcatgaga gtgatcccgg gacacaggga gaacaagccc cgctttgccc tctgggggtc 23580 

tccattcagc agaagaggca aatgacagac acacagccgc ctcctccccc accatggtgc 23640 

tctgcagcct caggagcctc aggtgcacca agggccaccc catccagggg gccatgcttc 23700 

cttgagtggt atcgttcctg agcgagtacc atctccacct tccagagggg ctgtgacaag 23760 

atcaacaaga atgagggcat aggagcctcg aaccaaacat gccctcttcc ctgcagaggc 23820 

tgactgcgcc cagctgctat caccaagccc ctgctcctcc ggccccgtgg ggacagggta 23880 

agaggggtgt cacatggaac agctctccaa acagtccctc tcaagctgct gtctcctgtg 23940 

catctagtga gaacccaacc aacaaaggga aggtgggaat tgctattccc attaggcaga 24000 

tgagaaaact gaggccccga aaggctggcc tgttccaggt tacaggcgct gagcggctgc 24060 

tctgggaaca cacttggtgt ctgctgaggg cccgagcccg gccatcatat gactcaccct 24120 

tcgccagcaa agcccgggtg tgggtgaact tttcctggca gcctgggact ccaaggtgct 24180 

ggcagccagc ccagggaagg ctcccgcgtg cctgcggcag acgccttgct ttacctgcac 24240 

gtccccaccc ctaggagcct ggacagagcc cagaccctcc gccacctcct gagaaggtat 24300 

caggggcatc agtctggact tgggggggaa tccacacagg ccttccccaa atgctccacc 24360 

gtggcccatg gaaaaggctg gaaaacgtgc aggagcagga gcctccgcat ggagcataat 24420 

tcacattcct tccccgagtt tcataacaga ggcctgctgg tttccttaaa tggggaattt 24480 

gcgagccagt cggtgaccag agactggttg gcgtggacgt gctcttgcag agtctcaaac 24540 

gctaccacaa gcccagccaa attccacgga ggaaaatcga cttccgaaga aaagagctgc 24 600 

agcatggcct tcgtgcagag ccagctgcgg ttgtggttgt gtgttatttt agggaagggc 24660 

cattttgcat tttaaagagg gggttgggtt tcaccctggc tttaatttga gacccggggg 24720 

ccactgcagc cccttgtcag gctggtacag gccggggact cctcccatgc taagccagtg 24780 

tctttctggc cccagatcct caggggccag agggtcatcc ccagagcccg ctctgccacc 24840 

cacatgggta ccctgggcct gggagggatg tgccttccct caaccctgcc tggatgtccg 24900 

cacggggcca cctgcattgc tgaaactgca acgaagtcga gtctcaggag gggcccccct 24960 

ggctgcaggg ctcttgatcc ttttggccac gtgcacactg aggtggacgc tcggacccag 25020 

agaccccctt catgatgatg gccggggcag gaaccccctc ctctgaggaa ggaccctggt 25080- 

gggggacagc actgcaggag ggcacaggag atgacggggg ctctagcagg gccgggagga 25140 

aggccaagat gctcctcgca accgtgtgcc tgtggccagg acagaggaca aacccaccct 25200 

ccactgtccc cactctcagg acagcagtcc tgccccagga ctcagcgccc acacttatgc 25260 

ctgaggacca ctattcaagt cagtatttgg cgagcagggg ttgctgccgc gggcgctgtg 25320 

acaggctgga atcctctccc tctccctctc cctctccgga gacatggagc ctacagggac 25380 

agagtcagca cctcagggta ggaccatggc tggcgtcatc agcatcactg gatctgatga 25440 

gtgggagccg gcatctcact gttttcactc tctcattcaa atgactggag caaagggaag 25500 

gtgtggggag aggcccagga atcaacacta aggtcaactt tgcccccagg ggcaggggtg 25560 

ggagtgaaca gccacaggtg tgatcctggg gagggcttct gggagagaat tcagaggcaa 25620 

gcatgtagag gaaccatttc aaatagttaa gaaaagccag agccaaacag ggacagttgg 25680 
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ctcgcagaga tgatgcaggc aaagccagct cagatctgag catgggaaag actactccca 25740 

accaagggcc cagcatctcc caaccaagca ccaagtacct cccaaccaaa tgccaagcac 25800 

ctcccaatca aatacctccc aaccaagcac ctagcacctc tcaactggac accaactact 25860 

cccaaccagg caccaagtac ctcccaacca agtgccaagc acctcccaac caagtaccaa 25920 

ttacctccca accaagcgcc tagcacctcc caactgagca tcatgcacct cccaacagag 25980 

catctagcac ctcccaactg atcacctccc aacctagcac cgagcacctc ccaaccaagt 26040 

gcagagcacc tcccaaccaa gtgccaagca cctcccaatc aaatacctcc caaccaagca 26100 

cctagcacct ctcaactgga caccaacaac tcccaaccaa gcgccaagca cctcctaaca 26160 

aagtaccaat caccttccaa ccgagcacct agcacctccc aactgagcat catgcacctc 26220 

ccaacaaatc acctagcacc tcccgactga tcacctccca acctagcact gagcacttcc 26280 

caaccaacat agcaaaagcc ataaagaagt aaaaagacaa aaccacgtag gcatggagac 26340 

tggacttctg gtggcgagga aagggcattt ttattataac gacagctaac atttgttgaa 26400 

ctcacaaact gttcttggtg ttttcctcat gacatgcagc atggtcacgc .ctctgtacag 264 60 

acaaggatac tgaggcacag agtggcaccg tgccaacctt gtctcatctt tttatcgaac 26520 

ctacatgcag agtgccagca aatccagctg tcttttctct tcagaacaga tcccaaatct 26580 

cgccactcct tacccccaca agtgaggtgt ccccgctgct gctttctgtc gccaggatcc 26640 

cggtaataac cgtggagagg gctcctgccc ccacgccacc caccccacag ctcactctcg 26700 

ctccagccac caggggatgc cttccagcac gagtcagagc tggcacctcc tctgctcgag 267 60 

acctcatgtg tcctctcctc acaccttggg ccctgtttcc ctacattctg ctacagcccc 26820 

tcaaacaggc cccgccccaa accagcccag ggcctttgca ctggctgatc cctctgcctg 26880 

gaccgcgctg cccccagaca gccacacggt tctcagcctc atctgcttcc agtctcgact 26940 

caaaagtcac caagaggcct tcccagcacc tgagctccga cggaagcccc tcgccacagc 27000 

acccaagcac tgctttatcc ccctacgcac acgtcccttt caaatactat tcatttacca 27060 

tctcctccca ctcactgaaa gggccagaga ctgggctata cccgctgcgt ggggagcagg, 27120 

accaggcgca agggctcaca aatgcagtgg atgcctggtt gggaggtgag ggagctgcag 27180 

cgacccacgc tgggagggaa cgcaatgaca ggaggagcgc aggtcctggc gacacgatgg 27240 

ccatggcagc cgctggtgag caaccgcagg ccggccctgg gagagggctt ctagcaagct 27300 

gctatcttca gcctctccga ctactgcaga tgccccctcc tagccagaga cactgctaca 27360 

ccagccgacc cttccaaaaa gaaggtcagt aaccccgcga ctcctggagc cacagtgcag 27420 

ggggagaggg ctgagagggc aacagttcac caagcggaac agaggctgcc ccggaggtca 27480 

gctggctccc cggcagctgc aggggtggct agcccactcg gagggcagcg agggcatacg 27540 

aggggctcca gggatgagtg gttgcccagc acagcacccc tgggaggccg ggggcacttc 27600 

tcaggtagtg ggggcacgag gctgctctgg cctgacctca gggactcaaa atactttggc 27 660 

gataaattcc accgtgtccc acccctgctg gtaccccata cttacacaca gactggttca 27720 

gatgcagaca ctctcgcgca catactcgct cacacgggca catacacgtg cacacacagt 27780 

cacatgcgca cactcataca cacacaaata tccactcaca cgcatgcatg cacacacacg 27840 

gacacacaca ggctcacacg tatgcacgca tatgcgtgca cacgcacaca cacacacaca 27900 

cgctcacatc ctcccactcc cacactcagt tgctcagaca cacacacgcc tggctctcac 27960 

acaaacctgt tgggctctga aaggctccag cccttcccat gctcgtcaga agccagtcaa 28020 

tggcttccta agtcaccaca cagatcaaag aggtgaactt ggccacatgg cactctgctt 28080 

cctgagctcc caaacaccag ccttggtgag gacagaccct caccccacac cctcattccc 28140 

actaccctgg gcaggcccag aggaggggca tctgcaggat ctggcaacca gcccctcccg 28200 

cccggctcct gcagccggca ccatgggagt cagggggagg tcactgcaaa gggcaacagc 28260 

aagttggtgg ccccaggact agagcccagg ggtcttcagt cctactccag agcttggaca 28320 

ctgtcccaca gggcatggcc aagggaaggg cttccagagc cctgacttca gggaggaggg 28380 

caggcgggct cctgtggcag gcctggatgc atggccgccc actcctggga ctttctaacc 28440 

tagaatatct aggtcaggct gggtgcagtg gctcacgcct gcaatcccaa cactttggga 28500- 

ggccgaggag ggtggatcac ttgaggttag gagtttgaga ccagcctggc caacatggcg 28560 

aaaccctgtg tctactaaaa atacaaaacc tagccaggtg tggtagtgca cgcctgtaat 28620 

cacagctact caggaggctg aggcaggaga atcacttgaa ctcgggaggt ggaggttgca 28680 

9tgagctgag a'tcgtgccat tgcgcaaaga agatctaggc cggcccctca accggtgagg 28740 

tccaggctgg gagtgctgag agactgtggt gacactgaat gaactaacag gcaaagggct 28800 

tccaactgag cctgggggtg gtgggaaatg gctcttgtgt tctagtcaag acctctgcca 28860 

accagttctg acactgaccc agcacagaac ctgacaggtc agcaagggcc agggcttagc 28920 

acagcccagg taagggtgtg tgtacggccc ccagagtcac tcccaggctg caagaaaagg 28980 

gacaaaggag ggacaagggg tggccaagca aactgttccc tctgctcggg agtctggggt 29040 

gacctggcct agctggccag tggagctggg ccacctcccc ttaaactctc caccccggac 29100 



Page 4 6 



WO 02/092015 



PCT/US02/15982 



032796-132. ST25 



ttcgactcca aagctttcct gccacccacg ctctccccac ctgggatcac ggccaggccc 29160 

tgagccttca agggcccagg tgaactcagc cagactagga gctgaggagg acacagggca 29220 

gcttccagaa cggacccgag aaccactccc agcaggttct gcttccagac aaggagctgc 29280 

actttttcag ccaatgcaat tagaaagcca ggagaaggtg caaattccac ctgcctgagc 29340 

gtccgcactt cccaggccgc ccaccataca cacagcaaag atgtgtttaa ccattcaaac 29400 

ccatggccaa ccacatcggt tgcctcagac atgcaagttt taaaaaggaa cataactatg 294 60 

ggccaggcac ggtggttcac gtctgtaatc ccagcacttt gggaggccga ggtgggtgga 29520 

tcacctgagg tcaggagttc gagaccagcc tagacaccat ggtgaaaccc catctgtacc 29580 

aaaactacaa aaattagctg ggcgtggtgg tgggcgcctg taatcccagc tacttgggaa 2964 0 

gctgaggcag gagaatcact tgaacccggg aggcgaaggt tgcagtgagc cgagattgtg 29700 

ccactgcact ccagcctggg caacaaggga gactccatct caattaaaaa aaaaaaaaaa 29760 

aaaaaggaac ataactatgg agtctcaagg ggaagtaatt ccttcaacaa taacaaatct 29820 

tgaaagctga gctctttttt ttttttgaga caggatctcc tcactttgtc gcccaggctg 29880 

gagtgcagtg gtgggatcac agctcactgc agcctcgatc tcccaggctc aaatgatcct 29940 

cctacctcag cctcccaaga agctgggatt acaggtgcat accatcacac ccgattcatt 30000 

tttgtatact ttgaagagat ggggtctcac catgttgccc agtgtggtct tgaattcctg 30060 

gactcaggtg atctgcccgc cttggcctcc cagagtgctg ggattacagg cctgagccaa 30120 

cacccccacg ggttcatttt cagagtcgca ccgagtgctg gggttacagg cctgagccaa 30180 

cccccccacg ggttcatttt aagagtgaca ccgagtgctg gggttacagg cctgaaccaa 30240 

cccccccacg agttcatttt cagagtcgca ccgagtgctg gggttacagg cctgagccaa 30300 

cccccccacg ggttcatttt aagagtgaca ccgagtgctg gggttacagg cctgagccaa 30360 

cacccccacg ggttcatttt cagagtcaca ccgagtgctg gggttacagg cctgagccaa 30420 

cccccccacg ggttcatttt cagagtcaca ccctttttct gaaaaacaac ttgggctcat 30480 

gcaaattcga gagagagatg gtgacactcc ccgccccctg gacccaggtg gagtcgcagc 30540 

agggtttacc cgtgagcggg gtccaaggcg atggccctcg gctggtcaag gtcctgccag 30600 

aagagcacct tccgggatgt gccattgagg ttggccacct cgatgcggtt ggtctctgag 30660 

tccgtccagt acagcttctt gcccacccag tcgcaggcga ggccgtcggg agagaccagg 30720 

ccggagatga ccacgttctg cacggcggcc cccgtctggt tcaggtaggt ctgcttgatg 30780 

gcctcctcgc tcacgtctgt ccagtacacg gctcccttgg aaaactggaa gtccactgcg 30840 

gccgcatcct ccaggccgct gaccacgatg gtggactcca gcttgactcc gccggcgtcc 30900 

accagccgta cgtcccggcg gttggcaaat agcaggagcg gcgaggctgt ggggcagaag 30960 

caaaccgtga gggccactgg ctaagccagc aagatacaca gccctgggat ggagcactat 31020 

gcccagagca ctcctggtac tgccctgccc atgcccaaga cctccagttc cttcctccca 31080 

cccctaaggc gttgtcagga agttgcctgg gcagccccgg cccgcatcat tcagaggctc 31140 

ctgcagcgca gcaaacagcc ttcttcccac attcggtgac agcacctgtt tgtttaccaa 31200 

ctgttacgtc tgttccccca gatatgggtg acccttcctg ccatgcccaa aacctcccac 31260 

atcgtcctcc agaggctaca ggggccctgt cctgttctgc agagaagcca catccccttt 31320 

gttggcctga cacaggggat ggggacatgc aggcacagca ctggccatgc tgctcgctac 31380 

agacccagcc acagggccac attttttgag gggttcagag cccaggccag acagagcctc 31440 

aagattccct tacaagtctt tgaccactgt ccaagctcag gcccgtttcc ttggccgtgg 31500 

catcagcttc ccatccaccc ctgtattcca tgtttctccc accctgcttc tggacattcc 31560 

tacatttaaa gggtcactct ggaatgccac cccttggctc agacaccttc cacagctccc 31620 

tgtgccagtg ccatgcagaa caaggtcaga ccccctagcc tggcctccaa ggccttggcc 31680 

tctggcctca cctacacttc tctccaccac cccaccccaa gcattcctga tctgcctgcg 3174 0 

gccaggctgg ctccctcacc tccctgtgca ccgcagccct cagccccttc tgcctgtgca 31800 

agaagcctca tctcacagac aacggtctca ttcccacaac gggctcaatg agaaatcagg 31860 

agaggccttc agaccatcac cccaccagac acctcagacg tcggaccagg agggtccagc 31920- 

aacccccaac acagactcag agggactaag aagccacatg aggagtgaac acaagatgtg 31980 

gacaggagga ggttaagggc ctccagggag ctccatcagt ccgtgttctg ctgtcagcag 32040 

ggttaggctg ggctggccac aaacaccccc aaaaaacatc tgaagccttg gcttgaaaca 32100 

gctgacattc ctcatgaaaa ctgcagaccc ctgggtcctc ctgcgcagat gggggagccc 32160 

agccaacccc acactcccac cttcaccaag aaagagaaag ccaaaacaaa ctcaactcag 32220 

ccaatgacaa tcacagaact gaatcctgta gttagttcag ttggtttcat ttcagcaggg 32280 

gaaagatttg cagcctctat gagggtagct gggaacacaa agggccagag catggcccag 32340 

gagaccccag cgcagtgggg tagatggttc cgagcacagg cctccctgcc aagacaagca 32400 

ctggctcaaa tcctggcccc tcccattccc aggagacatg ctccacagga tgggaggaca 324 60 

cacagaggac ctgaggccag gaaaatgaca gcggcgcctc cgccgcccca cccgtgctgt 32520 
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catcatctta ggtctacagt tctttgtggc aacgagggac actgtgaaag tcaaacaaca 32580 

ggaaggcata ggccacaaat aaagacaaac gggacttcat gggaagctaa agattttgtg 32640 

catcaaaaga cactatcgag agagtaaaaa ggcaacccac agaatgagag aaaatatttc 32700 

caaatcatag atctactaag agattaatat ccatgaaata cagagaactc ctaaaactca 32760 

acaatgagaa aacaactaag ccaactcaaa aatgggcaaa caacttgaac agacatttct 32820 

ccaaagatga catataaatg gccaataaac acatcaaaac aggcttaata tatccctaat 32880 

catcagggaa atgcaaatca aaactacaat aagataccat cttgcaccaa ttaggacggc 32940 

tactatcaaa aaaacaaaat agcaagtgtt ggtgaggatc tggagcaact ggaacccttg 33000 

tgcaccactg gcaaaaatgt gaaatggtgc agctactatg gaaaacagca tggcagttcc 33060 

ccaaaaactt aaacacagaa ttaccatatg acccagcaat ttcgctttgg gttatatacc 33120 

caaaagaact gaaaacaggg acacaatcag atatgcatac accttggatc acagcagcat 33180 

ccttcccaac agctaaaaca tggaggcagc caggcatggt ggctcacgcc tgtaatccca 33240 

gcactttggg aggctgaggc gggtggatca cctgaggtca ggagttcgag accagcctgg 33300 

ccaacatggt gaaaccccgt ctctactaaa atacaaaaat tagctgggcg tagtgacggg 33360 

cacctgtaat cccagctact cacaagtctg aggcaggaga atcacttgaa ccctggaagt 33420 

ggacgttgca gtgagccaag attgcgccac tgcattccag cctgggtgac acagcgagac 33480 

tctgtctcaa aaaacagcaa aacaaaaaca aaaaaacaaa caaacatgga agcaacccaa 33540 

gcgtccctct actgagggat gaatagcggg gcaaaatctg ctccatccac acaatggagt 33600 

actattcagt ctcaaaaagg aaaaagattc tggtcaggca cggtggctca tgcctgtaat 33660 

cccagcactt ggggaggctg aggcgggtgg atcacctgaa gtcaggaatt caaggcccgc 33720 

ctggccaaga ctggcaccna gctacacana aagtatangg ccccggaaa 33769 



<210> 9 

<211> 72049 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> unsure 

<222> (8356), (8385), (38585) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 9 

tataccttgc gcggaccttc ggctcctgtg gtgaagacaa tatgaagaaa atagaaatta 60 

cccataattt tgccacacag acttagttgt gtccatgtat cttgtgcacc ttttttctgt 120 

ttacggatca aaatcgactt ttagggtcag gcgcggtggc tcacacctgt aatcccaaca 180 

ctttgggagg ctggagttgg ggttgggggg tggatcactg aagatcagga gtttgagacc 240 

agcctggcca acatggcgaa actccatctc tactaaaaat aaaagattag ccaggcgtgg 300 

tggtgggtgc ctctaatccc agctactccg gaggctgagg caggagaatc gcttgaaccc 360 

aggagacaga ggttgcagtg agccaggatc acgccactgc actccagcct ggcaacagag 420 

cgagactctg tctcaaaaaa aaaaataaaa ataaaataaa taaatacata aattgacttt 480 

taggagattg gttcaaacaa tgtgtgtaat gttgtgtctg agtgtttttc atttatcgtt 540 

catgcaaatt ccgacatcat tcactcttct ccagagtgtg ctgttttcct gcctgtgtca 600 

tcacccgtca ccttgaatgc cctcgtttag gtaaaataag tacattttat tcaaaaatat 660 

ttgaggacat ttgggttgtc tccaggttct tggtcttgag ttttgctgtt cttgtggagc 720 

catggtggtg tctggttgca ggaacctcca tgcgttccag ctgctgcttc tgcctgtgtt 780 

cttagagagg aaatgctggg gtccgcggtt cccgggctgc tgaccaggaa gcctgcggtg 840 

ctttacggcc cttccagaag cgggagatgc ccccacttaa gtgtcagaca ggcctttcca 900 

cctcactggc agctctgagc ggctcccttc tatttgcaga tgactgagaa gttaccaatt 960 

tccacgttta ctgactgctg tttctcctgt taatttgtat ttatagtctt cgctaattta 1020 

ttgctagggt tttggtgttg tccctattga cttgtatgcc ttttaatttt ttaaacaaca 1080 

ttaatatact tcattttttt agagcagttt taagtttaca ggaaaattaa gggacaagta 1140 

cagagagttc cttccacctg ctgtcctcct ctcctcctcc ccaccttccc tccttcccct 1200 

attgtaactt tctttctgat attataaaag tcactcatgg ctgggcgtgg tggctcacgc 1260 
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ctgtaatccc agcacgttgg gaggcagagg caggcagatc acctgaggtc aggagttcca 1320 

gaccagcctg gccaacatgg tgaaaccccg tctctactaa aaacacaaaa agttagccag 1380 

gcgtggtggc gggcacctgt aatcccagct actcaggagg ctgaggcagg agaatggcgt 14 40 

gaacctggga ggcagaggtt acagtgagtc gagatcgcgc cactgcactc cagcctgggc 1500 

aataagagtg aagcttcgtc tcaaaaacaa agtcacacac gcttcttgta cgagggtcat 1560 

ttggccgagg ggccagatgg ctcaccatct agttgggaca ggccatgagc tcggaatgct 1620 

ttttacatat ttacatggtt gagaagaaaa tcaggagaat aatgttttgg gacatgggaa 1680 

aatgacatgg aatttgcatt ttagtgtcca taaatgaagt tttgtttgct cccagctgtg 1740 

ttgactgagg caggctggct tcctacagct gcggcagagc tgaggaggcg ggaaggagac 1800 

cgtgcaggcc gcagcaccga aaatatttgc tctctggccc ttcccagagt gcttgccgac 18 60 

ctctgtccga cagctagaag gaaggatagg acccgtccga cgataaccac tgttgacatt 1920 

tgagcgcgtt tccttcccgg cttttgtgtg agagtggcag tctgtttgct tttgtggtcg 1980 

ggatctgctg cacgcacggc gggctgtttg catgaggctt cctggaggat agggctgggc 204 0 

tcggagctgc acgcagtggg gcgtgtcctg catgcagtgg ggcctcagaa gagagctgtg 2100 

gtgggcgggg cagtgccaac gctggtgggt gccaggcctc cacgctcaga tcagccccgg 2160 

cgacaggttt gggccaccct ctctctggcc tctgtgcagt ggcccaggcc gtctgctctg 2220 

cctggcacac ttgcctctgt ccttccactg aagcgctcct cttaccctct gctcccggct 2280 

gggtacgttg aattgtgtcc ctcaaggaga tatgctaaag gtctaacccc aggaacctgt 2340 

gtatgtgatc taatttggaa acagggtctt ggctgatgta atcaagcgag gatgaggtca 2400 

ccctagagta gggggcctat atccacggtg ctggtgtcct catgagagca ggtgagcaga 24 60 

cactgacact caggggtgaa ggctgcatgg agtcagaaca gggcttagtg cgatggcggc 2520 

cacaagccaa ggaactccaa gtatttcctg caacaccaga agctggaaga tgccaggaag 2580 

gatcctgccc tggagccttc ggagggagtc tgtccctgca gacgtcttga cttttgattg 264 0 

cagggatgca tgtcttaggg tgtgtggggg ggtgcatttc tgatgttaga agccacctgg 2700 

ttggtggcga tgtgtcacgg gagccctctg caggttctgc gtgtccatgt ggtcggggac .2760 

agaggtgggc agggacggac ggtgtcgagc tggacatgtc catgacgtcg gccatccctt 2820 

gggatggctt ttttgttttg aggataaggc tgcctgccag gaagctgtgc cctgcctggc 2880 

ccttgcccca agcccctggc ctgtgcttgg cctcgcggaa gggatgtcgc ccttctctcc 2940 

tgcatgcgtg cagggaggaa ggggagaggt cagcagcccg cctggaggag gctcgggcga 3000 

ggggaaggtt tcactttcag gcaatgttgt ggggctgttt aaacaacccc aaagaaaacc 3060 

atttggccaa actgttagtt tccaaacatt ttacttcctt ggtgtttaaa taaattccta 3120 

ccaagactct gtagctggtc ccagggaagg agttggcctc tcttctttat agcccggcac 3180 

agtcagtccc ctgcacctgc ccctcccaac cccaggcctg cttccccgtg gccatggctg 3240 

ctgcccggac ctctctacac acagaacctc ctggaggcca gctgtgggca ccagccttgg 3300 

cagggctgtg gcggagccca ggctgctggt actctctctg cagctgctcc ctgctggcct 3360 

ggctggacag cgtccccacc accactgggg tcacctctgt gctggtcaca gctcactcag 3420 

accttcaggc aaatgggttg gatcctgcct ctctcccagg tgtctcagtc tctgcaaaac 34 80 

tcaaaaacct cagaggcctt gcagcctgag gggtgtcaga gacacctcct tcgaatcagt 354 0 

aaacacctac agattcaccc, cagcagtgaa aggactgctt cgccacagag gtttgattta 3600 

ctcctaagta attggaaggg atgccgagaa taggttcctc atggtgggac tagaggccct 3660 

ctgctgacct- agttaacaga gggctagggc tgggtgtgct cagcccctga aggttctagg 3720 

cccatttggg acaccccgcc agaacctgcc acaacctgcc atgtggtgac agctacctaa 3780 

atcccagagg ctcttgagct ggagagcaga cctctcaatc tcagcaggcc ccccacacag 384 0 

accccataac cctagtctgc cttcacagta cagttcgtgg ctatgtgttc acggatggtg 3900 

ttgttcacct aaggtctctg ccctgtgacc ccaagggcgt cctgagggca gattccaagt 3960 

ctgtttcgtc cacccctcct tccctagcag cgggtccagg gcctggcctg aactagcttc 4 020 

ccacagagat actggtggga tgatgaaggc agccaggcgg caagtgaaaa acgcacttcc 4 080 - 

tgcatgtgct ggctcctggg attgaagtgt ttgaggaagc aaagtgaagt gagctttcct 4140 

cttgcggctg tgtgtccttg ggccgggagc ctaccctctc tgagcgttgg ggtccttgtc 4200 

agtagaatgg ggcatcctca tagctcaagg ggtggtgtgt gaaaattgtg ctattgtgtt 4260 

actttaatga tttttttttt ttcgagacaa agtctcaccc caacgcgcag gctggagtgc 4320 

agtggcgcga tctcagctca ttgcaacctc tgcctcctgg gttcaagtga ttctcctgcc 4 380 

tcagcctccc aagtagctgg aattacagga gtgcgccacc aggcccggca tatttttcta 4 44 0 

tttttagtag agagggggtt ttaccatgtt ggctaggctg gtcttgaact cctgacctca 4 500 

ggtgatccac ctgcctcggc ctcccaaagt gctgggatta caagcatgag ccaccgcgcc 4 560 

cggcctactt tagtgatttc ttaggaggac agagggaacg ggctggcaag acaggcttgg 4 620 

aatgtgtttt gggatcaagt gccggtttct gtctggcact ggcgttctct gtggggccat 4 680 
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gatggacaca ctgctgaggt caagcgtgat tcgtcttgcg ctgtgcctgg cagtctcatt 4740 

ggaaagttct gtagacatcg tgtggatggg gctcttcccg gccaagccct tggggacctt 4800 

ccaggactgt gatctcccca cagtggctgt taagcaggga cctttcgtga agtggagtct 4860 

ctggtcccct ccaagtcata gctagacagg gactcgggca tcgccaagcc tggctgatta 4920 

ttcactggat gaggagacag gcccagagag gggcaggaac ctgcccgagg tcacccagca 4 980 

ggccccagag gtttcggtct cggattctcc ctgctcatcc ctggatgtag tgctgctgtg 5040 

gatgtggttc tgtgctgggg gctgtggaga gcagggggct tgtgccagga ccccagtgag 5100 

ggtggcgccc tcgccatgag gccgactgtt ggtatggggc ggccatccac tggggtgtgg 5160 

ggaggaacag ctttcctgag gaggaggtgg cgggaggaac agcttccctg aggaggaggt 5220 

ggcggtgctg tgtgacctgg gccttgaagg acaggtccat tgtcaacaga acattttggg 5280 

agtggagcct agagggagaa aatttgttga aattcagatt cccctccccc taccaataca 5340 

caccaaatca gatgcccctg accagatcta aatttggctc tcagagattt ccattgtagc 5400 

tgggcacttg gggaaccttc taagtgctgc ctctgcctct ccccagcctg cctgcctcag 5460 

tttccccagc cctgggcccg tgtcgctgtt gccatcacgt gggcgccctc tagtggagga 5520 

atcagattat gcactccggg gcttggagca ggagtcagga ggggctcctg tctttccttg 5580 

aaacgttgga tgccgggatc ctggaacagt ctctgcattc ctcctggcga gaaccagagc 5640 

ctgggcacag gggaccatct gttgtttgaa ggctgcagcc tggcagggca ctcaggagat 5700 

ctggcagttg gctgcagggc caggtctagg ggccagggca tcagggaggc tctgggctgg 5760 

ttcagccccg ggcccctttg cagattgtga cctgggcccc tgtgcagggg catggccaca 5820 

ggatgctggg aggggtctct gaccctgacc ttcttggctc tgtgcatcct tgagaccaga 5880 

aaggtctgga acaaatgagt agacgatgcc ctaacctggg gagggagcca catcctgatc 5940 

ccagcaacct cgggaaggat ctgtcaggat tatggggcac cctgggggcc ccaagtctgc 6000 

atgggtctcc acttgcaatt tctgtaggaa gctctgataa atccaaactg ggggtcctag 6060 

gacacagtca gaaatgctga taccgttgtg tgtggagcct cgggccctgg gggtcaggag 6120 

catgtggagg gtgggccacg ggggttcaga agagaatcct gtaacccccc accccccaaa 6180 

ctgaagccca cttgagggcc atggctgaaa ggttgggggg tctccgtgcg tcctgtggag 6240 

tgggtggtga ggagtccttg ggtttgcacg cctctgggcc tgagcggcgg gaccccgtcc 6300 

acagcggatc cctgggccct gttgctcaga tgctctcaga gtgttgctgt ggccacggag 6360 

ggagcctgag ttaagcttct cttgtgccgg ttgtacgctg tcaggtcaca ctggtgagtt 6420 

aggcagggca cagatgccca gagcagaggg aactttcctt ggggattcaa cacgtgcaag 6480 

tcttaggggc tggcaaatcc tgccctcagc tagagagggg gcttttattt gagaccagaa 654 0 

tcacctgagc atcctcctgt ccccagctgt gtccagcctg tctgcaggga catcctgaga 6600 

ggaccaggct ctcccctcat ccacctgcct aagtgccact ctgaaccctg tccacctgtg 6660 

ccgtggaggg gcgtgacctc aagctgctca gccagcagca ggcttggccc tggggggcag 6720 

cagagaccca ggtggctgtg gggtgggtgc ttcgtggcgt ggttctgaaa cttcgttgga 6780 

agtgtgtgga cagtgccttg cctgttctct gtgggaccct atttagaaac gaggtctgag 6840 

ttactggggg tcatcactgt gttctgatgg cccagctgtg tggaggccgc ggtgcagccc 6900 

catccaagga gccagggccc tgggtctagc cgtgaccaga atgcatgccc cggaggtgtt 6960 

tctcatctcg cacctgtgtt gcctggtgtg tcaagtggtc gtgaaactct gtgttagctc 7020 

ttggtgttcc tgaaagtgcc cccgggtctc aggcctcaga accagggttt cccttcatct 7080 

cggtggcctg ggagcatctg ggcagttgag caaagagggc gattcacttg aaggatgtgt 7140 

ctggccctgc ctaggagccc cccggcacgg tgctggggcc tgaagctgcc ctcgggtggt 7200 

ggagaggagg gagcgatgaa gtggcgtcga gctgggcagg aagggtgagc ccctgcaagg 7260 

tgggcatgct ggggacgctg agcagcatgg ccagcagctg ggtctgcagc ctggtacccg 7320 

gcgggacttg tggttggggc tggtttgtgg ccaggagagg ggctggcagg agacaagggg 7380 

gactgtgagg cagctcccac ccagcagctg aagcccaatg gcctggctgt gtggctctca 7440 

gctgcgtgca taacctctca gtgcttcagt tctctcattt gtaaaatgag gaaacaaaca 7500 • 

gtgccagcct cccagaggtg tcatgaggat gaacgagtga ccatgtagca tgggctgggt 7560 

gcgtgtcacc taacatcacc agcctttgca aggagagccc tgggggcctg gctgagtatt 7 620 

tcccttgccc ggcccacccc aggcctagac ttgtgcctgc tgcaggccct tgacccctga 7 680 

ccccattgca cctgtctcca caggagccga ggaggtgctg ctgctggccc ggcggacgga 7740 

cctacggagg atctcgctgg acacgccgga cttcaccgac atcgtgctgc aggtggacga 7800 

catccggcac gccattgcca tcgactacga cccgctagag ggctatgtct actggacaga 7860 

tgacgaggtg cgggccatcc gcagggcgta cctggacggg tctggggcgc agacgctggt 7 920 

caacaccgag atcaacgacc ccgatggcat cgcggtcgac tgggtggccc gaaacctcta 7 980 

ctggaccgac acgggcacgg accgcatcga ggtgacgcgc ctcaacggca cctcccgcaa 804 0 

gatcctggtg tcggaggacc tggacgagcc ccgagccatc gcactgcacc ccgtgatggg 8100 
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gtaagacggg cgggggctgg ggcctggagc cagggccagg ccaagcacag gcgagaggga 8160 

gattgacctg gacctgtcat tctgggacac tgtcttgcat cagaacccgg aggagggctt 8220 

gttaaaacac cggcagctgg gccccacccc cagagcggtg attcaggagc tccagggcgg 8280 

ggctgaagac ttgggtttct aacaagcacc ccagtggtcc ggtgctgctg ctgggtccat 8340 

gcgtagaaag ccctgnaaac tggagggagc cctttgtccc cctgncttca gtttcctcat 8400 

ctgtagaatg gaacggtcca tctgggtgat ttccaggatg acagtagtga cagtaagggc 84 60 

agcctctgtg acactgacca cagtacaggc caggcctctt tttttctttt tttttttgag 8520 

atggagtctc actctgtcgc ccaggctgga gtgcagtggt gtgatctcag ctcactacaa 8580 

cctctgcctc ctgggctcaa gtgattctcc tgcctcagcc tcctgagtag ctgggattac 8640 

aggtgcctgc cactgtgctt ggctaatgtt tgtatttttg gtagagatgg ggtttcaccg 8700 

tcttggccag gctggtcgca aactcctgac ctcaggtgat ccacctgcct cagcctccca 8760 

aagtgctggg attacaggca tgagccacca cgcccggtca ggccaggcct cttttgaaca 8820 

ctttgcacac catgggtctt ttcatccagg ggggtaggta cagttgtaca gttgaggaca 8880 

ctgaagccca gagaggctca gggacttgcc cagggtcaca cagcaggatg tggcaggtgt 8940 

ggggctgggc ctggcagcgt ggctccagct ttccagcata gaaatctgtg aaagcagata 9000 

gtttgtcggt cggtagggga gactttctga gacccgcccc agcggctcag agggtagtag 9060 

ccaggggcct tcctgggggc tcataaccca gaacactgaa tgggaaaacc ctgatggagg 9120 

aggcgcagtg gagctgtggg tgccgatggg aagtcccaga ggagctggga ggtcagtagc 9180 

ggtgctgccc tctgtggagc acttagtggg caccaggtgt gtttccaggt tcatggccct 924 0 

gggacctgaa gctcagaagg tgaagtaact tgcccagggc acccgtcggg cagcggcggg 9300 

cagaggattt gtgggctgtg gagcctgtgc tcgtggccca gccctggggg ttgtgagtgt 9360 

gctggccggg gagcttttcc tgcaagtgga ctggtgtcta ggagccagca tgtcaggcag 9420 

caggcagcgg gagtgcagca ggcagcggga gcacagcagg cagagggcgg ggctcgagca 9480 

gccatccgtg gaccctgggg cacggaggca tgtgggagag ggctgctcca tggcagtggc 9540 

tgaagggctg ggttgtgccc cgaggagggt ggatgagggt aagaagtggg gtccccaggg 9600 

gctttagcaa gaggaggccc aggaactggt tgccagctac agtgaaggga acacggccct 9660 

gaggtcagga gcttggtcaa gtcactgtct acatgggcct cggtgtcctc atctgtgaaa 9720 

aaggaaggga tggggaagct gactccaagg cccctcctag ccctggtttc atgagtctga 9780 

ggatcccagg gacatgggct tggcagtctg acctgtgagg tcgtggggtc cagggagggg 9840 

caccgagctg gaagcgggag gcagaggggc tggccggctg ggtcagacac agctgaagca 9900 

gaggctgtga cttggggcct cagaaccttc acccctgagc tgccacccca ggatctgggt 9960 

tccctccttg gggggcccca gggaacaagt cacctgtcct ttgcataggg gagcccttca 10020 

gctatgtgca gaaggttctg ctctgcccct tcctccctct aggtgctcag ctcctccagc 10080 

ccactagtca gatgtgaggc tgccccagac cctgggcagg gtcatttctg tccactgacc 10140 

tttgggatgg gagatgagct cttggcccct gagagtccaa gggctggtgt ggtgaaaccc 10200 

gcacagggtg gaagtgggca tccctgtccc aggggagccc ccagggactc tggtcactgg 10260 

gcttgccgct ggcatgctca gtcctccagc acttactgac accagcatct actgacacca 10320 

acatttacaa acaccgacat tgaccgacac cgacatttac cgacactgac atttaccaac 10380 

actgtttacc aacactgaca tctactgaca ctggcatcta ccaacactga catttaccga 10440 

cactgacatt taccaacact atttaccaac actgacatct actgacattg gcatctacca 10500 

acaccaacat ttaccgacac caacatttac caacactgaa atttaccgac accgacattt 10560 

accgacaccg tttaccaaca ccgacgttta ccgacaccga catttaccga cactgatatt 10620 

taccaacact gacatctact gacgctggca tctactgaca ccgatgccag catctaccaa 10680 

caccgacatt taccaacact gacatttacc aacactgaca tttaccgaca ttgacattta 10740 
ctgacactga catctactga cactggcatc tactgacact gacgtttacc gacactagca . 10800 

tctactgaca ctgacattta ccaacaccag catctaccaa caccgacatt taccaacact 108 60 

gacatttact gacactgata tctactgaca ctggcatcta ctgacaccaa catttaccaa 10920- 

caccagcatc taccaacacc gacatttacc aacaccagca tttaccaaca ccgatgttta 10980 

ccaacgccga cgtttaccga cgccagcatc taccaacact gacatttacc gacaccgaca 11040 

tttaccgaca ctgacattta ctgacactga catctactga tactggcatc taccgacact 11100 

qatatttacc aacgccagca tctactgaca ctgatgttta ccaacaccga catttacgag 11160 

caccgacatt tactgacacc aatatttact gacatcaaca tttagccatg tgatgggggc 11220 

cggcttgggg gcaggccttg ctcttggcac tggggatgct gcagagacca gacagactca 11280 

tggggtcatg gacttctgct tcttctccag cctcatgtac tggacagact ggggagagaa 11340 

ccctaaaatc gagtgtgcca acttggatgg gcaggagcgg cgtgtgctgg tcaatgcctc 11400 

cctcgggtgg cccaacggcc tggccctgga cctgcaggag gggaagctct actggggaga 114 60 
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cgccaagaca 
gcccagccac 
tctgctattt 
gccttgagag 
aggtgcctgt 
tttgaggcca 
tgtgagattt 
tgccctgctt 
gtttggtgat 
tccctttgct 
gtggacgcag 
agcctcgccc 
cctgccgctg 
tgtggtcggg 
ggtttcagtg 
tctcttgagc 
tgttgtcgtt 
tctgcctcct 
agtgcccgcc 
gttggtcagg 
agtgttggga 
ttctcttggt 
tgacattggt 
attttcctga 
cagggctccc 
cgagtgtgtg 
accttcaggg 
ccctaggagc 
ggcaaggccc 
gtgggagcag 
ccatgtactg 
gagtgcagtg 
tcctgcctca 
tttgtgtatt 
ctcctgaggc 
cagccgtcac 
tgattttaac 
cccagtactt 
ctggccaaca 
gcaggtgcct 
gaggtggagg 
aagactctgt 
catttagtat 
ttcttctgcc 
cccagcctct 
ttttgtgtga 
tgtgttttcc 
gtgcagtggg 
gaggcaggag 
taaaagttaa 
gaggttggag 
ctgcactcca 
gttcctttct 
tctgtttatc 
gtgaacattt 
gtaggggtag 
aaactgtttt 



gacaagatcg 
cccctgcagc 
ggccacatgg 
ggtagcgcta 
agacacagct 
gcccgggcaa 
aaattatttc 
aagacagaaa 
tggttcctaa 
gggtgcagcc 
cctcgccctg 
tgccgctgtg 
tggtcgggtt 
tttcagtggc 
gcctcgtccc 
ttttttgttt 
gttgttgccc 
tgggttcatg 
accacgcctg 
ctggtctgga 
ttacaggcgt 
gccagtgata 
gcctctcctc 
gctgcatcag 
aggggaggcc 
tctgtgttgc 
tggggaaccc 
tgcaggtgtg 
atgtttgcgg 
gtcagggact 
tctcagcagt 
gcgcaatcac 
ccctctgagt 
ttttgtgggg 
acaagcgatc 
acgcagctca 
cattttaaag 
tgagaggctg 
tgatgaaatc 
gtaatcccag 
ttgcagtgat 
cttgaaaaat 
gaggacaatg 
ctgaaggaaa 
gtcaaccact 
ctggaatcct 
agattcaccc 
ttaggcctgt 
tttgagacca 
ctgaacgtgg 
gatcgcttaa 
gcctggacaa 
ttttgtggct 
gggtggtggg 
gaattcaagt 
gattgctgag 
caacagtggc 



aggtgaggct 
cagatgtacg 
aatgcttgag 
tacaacgtcc 
acttggagac 
aggggacaca 
acaatacaaa 
acatgggtgt 
gcagttgaag 
tcgccctgct 
ccgctgtggt 
gtcgggtttc 
tcagtggcct 
ctcgtcctgt 
atgggcgtgc 
gttgtttgtt 
aggctggagt 
ccattctcct 
gctaaatttt 
actcctggtc 
gagccaccgc 
tgtacatgtg 
ggggtggatg 
tccttgttga 
cgaatcaggc 
aggatttcag 
agggatgggg 
gccatttcct 
gccgctgagg 
cgcgtgtctg 
ggttttcttg 
ggctcgctgt 
agctgggact 
ggagatgggg 
cacctgcctc 
atgattttat 
tgaacagttc 
aggtgggcag 
cagtctctac 
ctactcggga 
ctgagatcat 
aaataaataa 
tggtgcaggt 
ccccatgccc 
aatctacttt 
gcaatatgtg 
acacattgtc 
aatcctaaca 
gcctggccag 
tggtggtggg 
gcccaggagg 
cagagcaaga 
ggatgacatc 
cagtggtttc 
ttttgtttga 
tcctatggta 
tgcgccgttc 



cctgtggaca 
tattggcgag 
aaaatagtta 
tgtggttacg 
tgaggtggga 
ggaatcctct 
attaagacaa 
tgttgaagcc 
gcagttttgt 
gctgtggtcg 
cgggtttcag 
agtggcctcg 
cgtcccgtgg 
ggacgcagcc 
tttggcagct 
tttgtttgat 
gcagtggcgc 
gcctcagcct 
gtatttttag 
tcacatgatc 
gcccagccct 
tccccatcgc 
cctccctctg 
cgtcaccatc 
agcttgcagg 
accctgcttc 
gaggctgagt 
gctggagctc 
atgatttgcc 
tggcacacca 
agacagggtc 
agcctcaatc 
acagacacat 
tttcgctgtg 
ggcctcccaa 
tgtggtaaaa 
aggctgggcg 
atcacctgag 
taaaaataca 
ggctgaggca 
gccactgcac 
aaaaaatttt 
atctctgcta 
accggcactc 
ctgtctctgg 
gtccctgcgt 
gcacgttatc 
ttctgggagg 
cctagcaaga 
cacttgtggt 
tcaaggctgc 
ccctgtctga 
ccattgtatg 
caccttttgt 
acacctgttg 
atgttaggtt 
tgcatcccca 



tgtttgatcc 
gcaccgatgg 
caatactttc 
taagatgtta 
ggatcgctgg 
gcactgcttt 
aaagttaatc 
agaggcagct 
ttttccatag 
ggtttcagtg 
tggcctcgtc 
tcccgtggac 
acgcagcctc 
tcgccctgcc 
ttttgctcac 
tttgtttgat 
gatctcagct 
cccacatagc 
tagacagggg 
cacctgcctc 
ctgttgagca 
accatcgtca 
tttccagcaa 
ggggtcacct 
gcagggcagg 
tgagcgggag 
ggacgccctt 
cttgtaaatg 
ttcacgcatc 
ggcctgtgac 
tcgctcgctc 
tccctgggct 
accaccacac 
gtgcccaagc 
agtgctggga 
taaacatagc 
tggtggctta 
gtcaggagtt 
aaaattagcc 
ggagaatcgc 
tccaatctgt 
aaaaagtgaa 
ctatctactt 
acgcccattc 
gggttcactt 
gtggcttctt 
agaatctcat 
ccaaggcggg 
ccccagctac 
tcccagctac 
agtgagctat 
aaaaaaaaac 
gccacagcac 
ctcctgtgaa 
tgaattattt 
tgacttactg 
ccggcagtgt 



aggaggccag 
gtgcctgtgc 
tgacaaaaac 
tcattcggcc 
agtccaagag 
tgccacttac 
acatatccac 
gctggcctga 
atgtctgttc 
gcctcgtccc 
ccgtggacgc 
gcagcctcgc 
gccctgccgc 
gctgtggtcg 
ctgtggagcc 
tgtttgtttt 
cactgaaacc 
tgggattaca 
gtttcaccat 
ggcctcccaa 
tattttgagg 
cccattgagg 
cttctgaagg 
ttgctctcct 
atggagaaca 
gagtttcagc 
cccacgaaaa 
ttttgttttt 
cccgctaccc 
aggcgttgtt 
acccaggcga 
caggtgatcc 
ccagctagtt 
tgatctcaaa 
tgacaggcat 
acaaaattga 
tgcttgtaat 
tgagaccagc 
gggcatggtg 
ttgagcccgg 
gtgacagagc 
caattcaggg 
ctagaacact 
tcccctctct 
cttctggacg 
tccatagcat 
tcctgactgg 
acgatcactt 
caaaaaattt 
ctgggaggct 
gatcgcacca 



attttgtttg 
taatgctgct 
ggatatatgt 
aggaaccatt 
gtgagggttc 



11520 
11580 

11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
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tgactttacc tcctcacaaa cgcttctttt ccatttaaaa aaatattcag ccaggtgctc 14 940 

tggctcacgc ctgtaatccc agcactttgg gaggccgtgg cgggcggatc acctgaggtc 15000 

aggagttcga gacgagcctg gccaacatgg tgtaacccca tctctaccaa aaatataaaa 15060 

attagccggg tgtggcagcg ggcgcctgta atcccagcta cttgggaggc tgaggcagga 15120 

gaatcacttg aacccgggag gcagaggttg cagtgagcca agatcgcgcc actacactcc 15180 

agcctgggtg acaagagtga aactccatct aaaataaaac aaaaataaaa ataaataaaa 15240 

atttattaaa acattcatca cagccagcct agtgggtgtc ccatgtggct ttgcctcgca 15300 

tttccctgat aactaggatg ctgagcgtct tgtcccaggc ttgccacacc tcagcacttt 15360 

gagatacgtc gcacagtccc catttgcgaa cgagaaatga ggtttaggga acagcagctg 15420 

tgtcatgtca cacagcgagc agggggtctc tgagccgtct gaccccacag ccgaccaagc 154 80 

tccaatcctt accgcctcct agtgttgtgg atgtagccca gggtgctccc acatttttca 15540 

gatgagaaca ccgaagctca aaacaggagc gttttgtcca cattggatac acgatgtctg 15600 

tggtttggtc ctgaagtcac tttatatctc agtggtccag actggagtag gacagggggt 15660 

tctggggaat ggggaaggtg tctcaggtga aaggaaggaa ttccagattc tccatactgt 15720 

ccttgggaag ttagaagact cagagggtct ggcaaagtca gacaaagcaa gagaaatgca 15780 

gtcaggagga agcggagctg tccaggaaca ggggggtcgc aggagctcac ccccaggaac 15840 

tacacttgct ggggccttcg tgtcacaatg acgtgagcac tgcgtgttga ttacccactt 15900 

tttttttttt tttgaggtgg agtctcgctc tcttgcccag tctggagtgc agtggcacga 15960 

tctcggctca ctgcaagctc tgcctcccgg gttcatgcca ttctcctgcc tcagcctccc 16020 

gcgtagctgg gactacaggc gcctgccacc gcgcccggct aatttttgta tttttagtag 16080 

agatgggatt tcactacatt agccaggatg gtctcgatct cctgacctca tgatccgccc 16140 

gtctcggcct cccaaagtgc tgggattaca ggcgtgagcc accgcgcccg gcccgatttc 16200 

ccactttaag aatctgtctg tacatcctca aagccctata cacagtgctg ggttgctata 16260 

gggaatatga ggcttacagg ccatggtgct ggacacacag aagggacgga ggtcaggagg 16320 

tagaagggcg gagagaggga acaggcggag gtcacatcct tggctttcaa aatgggccag 16380 

ggagagacac cctctgagca tggtaggaca ggaaagcaag attggaacac attgagagca 16440 

accgaggtgg ctgggcgtgg tggcttacgc ctgtaatccc aacactttgg aaagctgagg 16500 

tgggtggatt gcttgaggcc aggagttcaa gaccagcctg gccaacatgg tgagaccccg 16560 

tctctactaa atatacaaaa attagccagg cgtgatggtg catacctgta atcccagctg 16620 

cttgggaggc tgaggcagga gaattgctta aacctgggag gcggaggttg cagtgagccg 16680 

agatcccgcc actgcactcc agcctgggcc acagagtgag actccatctc aaaaaaaaaa 16740 

aaaaaaaaga taaaaagacc aaccgaggaa* ttgaagtggg ggggcgtcac agtagcagaa 16800 

gggggatcgt ggagcaggcc accctgtggt catgcactgg aagctcatta cctgacgatt 16860 

tggagctcat cactgggggc ctaaggagaa tagatactga aggatgagga gtgatggcgc 16920 

ggggcacggg tgtctttggt ggccagaact tggggactgc tggggtgcct cactgcaggc 16980 

cttctcagcg ccctttatat gcttacacag gctgtttcta agagggggat acattgcata 17040 

agcgttttca gactacctca tcatgggtcc ctttctttac cctctgtggc cctggtggcg 17100 

cactctctgg gaaggtgcag gtggatgccc agacccgccc tgccatccac ctgcacgtcc 17160 

agagctgact tagcctcgag attgctgctg gcacctcctg ccccgggaca cctcggatgt 17220 

gcccgtggag atgctggctc tgtgttttct gctggagttt ggtgcgtctt ttcctcctgc 17280 

aagtggccac cgctcttggg tatgtcctca ggcttctgcg agtcatggct gcttctcagg 17340 

tccttgccca gcgccaggag caaaccctcc tggcactttg ttcaggggtg gatgcgccag 17400 

tgttcctgct gtggaccgcc atctcacatg agggtcttgg gcctgcaggc tcgttcagga 17460 

aacacccgct gagtatgcag tgtgtgccag ctgtgtccca ggcaatggcg gggacagtgg 17520 

ctgctgctgg ggttgtggtg gcttctgggg actctgggga cagctgaggt gcaaggagcc 17580 

acggctcctt gaggatgcag ttggactcca ggtggaaggg atggttgggg gaggtataaa 17640 

tggggtcagg gaggagacac atttggaaca atgggaacat ttttaagatg ctatgtcggg 17700- 

aggcaacaag gtggccaacc caggtgctga ggagcccaca ccagccctgg acgtgttttg 17760 

ccgctcacct ttgctgggga gtggtgggag agaggattcc gttccacgtg gtggtgtgcg 17820 

cagctgggct gtgtggagct gggcgctagg aggaaggtgc tttctgcggg gctagccggg 17 880 

ctctgccttt gaacacaatc aggctccagg ttttcagcat ccagtgcatg agaggacttc 17 940 

acgggcagct gtggctgatc ccttgatgaa ttgggagaag aacaaaggtc tatgaaatga 18000 

ggtttcatgt agatggcatt agagacgccc acaacagatt tacagagtgg agcggagacg 18060 

gcggatgggt ctgggaggcc cctcctgctg gccttgactg tgacagctgt cctgggaatc 18120 

agcttccagg ccgccccagc agcctgactg acacacacag gggttttagc cccatcctgc 18180 

gaccagctgt tgccatcatc agtgacagct gggagtggcg gtggttccag ccctgggcac 18240 

cctccccacc tgctggggcc cacccagggc agtcctgaca cctacaggtt gcttggagcc 18300 
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gcatccgagt cctgccccac cacgtgtgaa gcccgagtgg tcgtgggctg aggtcccctg 18360 

attgcatccc cacttccctt ctgcttcaca tagctgcctc ttctcaccgt ttttccagcc 18420 

tcctgggcta ggaattccag tgttgtgctg gctttgcccc aggacacctc cttagccctc 18480 

ttcctgagtc tagagccccg ggggttggaa gtcctggccc ctgggacacc tgcagccaca 18540 

ctcagcttct cctgtgagcc tccagcatgt cccctcagga ccaagccctc acgttcttgc 18600 

ctccccgccc acctgggctc agccagggga aggcctggct gggagcgtct cccctctgcc 18660 

ctgcccttct cccctcctac cctgcccttc tctcctctgc cccgccatgg cttttatatc 18720 

ctgtgccaca agacatggct gtgtgtgaaa gtggcagggt ctggcatctc tgtgggtctc 18780 

tgaggcccac gctccagtgc cactcttccc acccgctggc cgtgccctca tgctggaggg 18840 

acagcccagc cctctcccga accccagccc catgtgccca gctgcccccg gccctctccc 18900 

ctggaagccg gggtcactcc agccgtatgc catggtgggg acatcctgct tccttggcct 18960 

tccagggaag gtcctctttc caaatggcga cacctggtcc ctgcctggag gctggaagct 19020 

gtggcccttg tatgcccctc cagggtctgt gcgctcggtt ggcccgagtt cccatcaccg 19080 

tcatcatcac catcatcatt gtcatttcgc ttgtctgtga gccggcctgg tctcccagag 19140 

cagagaccct ctgaggtcca gcctgagttg gggtctccgt gctgacccct gacggggact 19200 

caggacgtac caggtctggg tcaggagtga cccccaaacc tcgtgccctt tgacaggcac 19260 

ccctgacttt tgctaagtgg gtggaggtga catcacttac agcgggagtg atgggacagg 19320 

gtctgttggc tgcactgtgc tcccagggat ctggggagag gctatatccc tgggctttgg 19380 

cactgcagag ctgtgtgtgt ttgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 19440 

gtgtgtgttt gcgtgcgcgc acatgtgtat aagatctttt tttattacat gaagcaagat 19500 

aactgttgct gtttcctttt gggttttgtg ttcaacagag tggggtactt cttccctcag 19560 

acaacagaac tctccccttt aaacacgtgc tgtcagaggg tgggtcttgg gctcatgtct 19620 

gtttgcacag ccgagtcaga ggaaacacag ggttcttcat aaaaacactg cacagcaggc 19680 

gactgtccag agtcagcctg caggacggca gcagccctgc ccctcagagc acagctaggg 19740 

tgggctgctt tgggatctcc cgtcattccc tcccagctgg cagccggcgg ccggcccatt 19800 

ccttggtgtg ctggtcaggg gggcgtgcgc ctgctctgct caccctggga atgggacaga 19860 

agctggcagc tcggagagga cagggctgga cccttgggtg gcctctggct ggaccatctc 19920 

attgtcctca gacacagcct ctcgggtcta gtttcatttc ctgaaaaaca agtgcacaga 19980 

actagagcag gagtcgagag ctacggcccc cgggccagat ccagccctgc cacctgtttt 2004 0 

cacaccatgc tcaagctgag tgggttttac attttttaat tacttgaaaa aaaaaaagcc 20100 

aaaggaggtt tcatgaccca tgaaaattat atggaattca aaaaaaaaaa attatatgga 20160 

attcaaattt cagtgtccat aaataatttc ttgagacagg gtctcgctct gtcacccagg 20220 

ctggagtgca gtgctatggc atggctcgct gtacccttga cctcccaggc tcaagcgatc 20280 

ctcctgtctc agcctcctga gtagctggga ctacgggtgt gtgccaccaa gcccggctaa 2034 0 

ttttttttta attttagtaa agacagggtc tttctatgtt gcccaggctt ttctggaact 20400 

ccatcttggc ctcccaaagt gctgggatta caggctcgag ccacggagcc cagcctgttt 204 60 

ttgttttttc actgataaag ttttgccggg tgtggtagtg tgtgcctcta gcgatttggg 20520 

aggctgaggt gggaggatcg cttaagccca ggagtttgag gctgggctca agtgatcagg 20580 

aggtgaacta tgatcatgtc attgcattcc agcctgggtg acagagcaag aacctatctc 20640 

ttaaaaatat atatttaaaa agtattgggt gtggtggctc acgcctgtgg tcccagctac 20700 

ttaggcatct gaggtgggag gatggcttga gcccaggagt ttgaggttgc agcgagccaa 20760 

gatcgtgtca ctacactcta gcctgggtga cagagcccag accctgcctc tttaaaaaaa 20820 

aaaaccaaaa aacatgtatt ggaacacagc catgcctgtt cagtcacgtg ctctccatgc 20880 

tgctttctgc tccagagacc cttatggcct gaaagctgaa aatattttct atcctttaca 20940 

aaaaagtttg ctgacctctg tcctggaaaa ttcatctccc aagttctctt ccggcactgg 21000 

cgttcctggg tgtcctaaat ttggcccctg ttatttctga actctgtttt ggctctgttc 21060 

cctcccagga gccaggacag gcacgttctc tgcatcttgt cccctgacgc ccagaggctt 21120- 

ggctcggctc aggcattctt ggaaatatct ggctccagga aaggcagagg cctcctgagt 21180 

cggcccagag ggaacctgcc ccaggtctgg gggaggcctg acccagcaga gtggcttttg 21240 

ccgatgggtt gggccggtca agatgtgctg aaagttgtcc tcagaaggcc actttgggat 21300 

tccttcctcc agtattagag caactgagag ctgctcattg caagcctgat gttttcccag 21360 

ttggccgggt ccaccgggtg ccctgggatt ctgggatctg ggtggaaagt agggggcttg 21420 

ggggagtgtc ctgggttctg gaatccaggt ggcaagtggt gaggttcagg gagtggcttc 21480 

tgagccacca taggggtctc tgtgggaggc tctgcccatc caggagattc cgcaggccct 21540 

gccggcccag agccagcgtc ttgcgcttgc cgaggctaca gccagcccca gccgggtgga 21600 

acagcccgtc gcctcctctc actttgtttt ggggccacct gggagtgtgg agcaagggta 21660 

gagagggagg aagtggctgc cggccgctgc ccagcaccct tgtttgcctt gggccctctg 21720 
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tgggctcctt tttattgctc ttcaatgaag ccagggaaat ggacttcctt gcctcacttc 21780 

agttcaacat gtctggaagt ttggtattaa aattaagaaa gtgtggaaat agagcaagaa 21840 

gagaaaaatc tctccaagag ataatagtga cctctgagct gggcgcggtg gctcacgcct 21900 

gtaaatccca gtactttggg aggctgaggc gggcagatca cctgaggtcg ggagtttgtg 21960 

accggcctga ccaagatgga gaaaccccgt ctctactaaa aataaataaa taaataaata 22020 

aataaataca aaattagcca ggcatggtgg cgcctgccta taatcccagc taaggcagga 22080 

gaatcgcttg aacctgggag gcaaaggttg cagtgagcca agatcacgcc attgcactct 22140 

agtctgggca acaagagtga aactccgtct caaaaaaaat aaataaataa aaaataaaaa 22200 

tagtgacctc tggccaggtg tggcagctca tacccgtaat cccagcactt tggaaggaag 22260 

gccgagatgg gcagattgct ttagcacagg agtttgagac cagcctggcc aacatggtgg 22320 

aaccccatct ctacaaaaat agaataaaat ttaagaggta atagtgacct tttggtagat 22380 

cgaaacctgg attgctttct ttttctaaat gctgattctt ttctttgtgg tgtttgtgtt 22440 

ctgtgccgat gtccctcccc cagccctgtt attgtgagtg gaagaagggg aaagggttcg 22500 

cccgctactg tgagcccctc ctctcacgct gggtgtcctt ggagaagcct gcacttcttc 22560 

attgtacgcc agggctgggt ccctccctgg agtggttctg tgctgctggg atggggccaa 22620 

cccctcagat gttttctgag tgtcacacac aggtgtgtgc attcatggcc tttgcgtgtc 22680 

ttcctgttgt ggaggcaaaa atgtgaagaa ccctagatga ttttgggacc agggctccat 22740 

cacctgctgt tcattgcaca ccggagcatc caggcatggg tggagagctc agacttccag 22800 

gcacggtcgc aggggctggt ctaaccatgt tcccgcccgc ctgctcgtca gaaccgcctg 22860 

ttgggagctg ttatcatgat accatacctg ggccctgggc tatccgattc tgacttaatt 22920 

gctccaggtt ggggccaggc cgttgtttgc tgttttgttg tttcttctgt gacgttagcc 22980 

actgggctaa tctgagcccc tcagttacag gtggagaaac tgagacccat gggggtgcaa 23040 

ggacttgccg aggacccaga gccccttggg ggcagagctg aggcggggcc tggctttggg 23100 

tcccagagct tccagtcccc ttcccgctct cctaacagct tttttttttg agacaagatc 23160 

tcaccctgtc acccaggctg gagtgcaatg gcatgatctc ggctcactgc aatcttcgct 23220 

agctgcgttc cagcgattct cctgcctcag cctcccgagc agctgggatt acaggtgtgt 23280 

gccgccatgc ccagctcgtt tttttttgta cttttagtag agatagggtt tcaccatgtt 23340 

ggccaggctg atctcgaact cctgacctca aatgatccgc ctgcctcggc ctcccaaagt 23400 

gctaggatta caggctggga tcacactgtg cctggcccta gcagctttgt cctgtgccat 234 60 

ccaacaacag atgaccgaag tctttgtttc ttaacatgca ttccatctgc cttacagttt 23520 

tgccacctgc aaaacagagg acttgtcgct tttctggtaa gctggaaatg taatctggta 23580 

gcaggaggcc tgtggaagct tgcctttaat ggccttgtgt ctctttcatc ctgtcctgag 23640 

agccggagaa cttggatgtt gcacctaact caaccttcct gttaacatac agttctgcag 23700 
gctcatggat catcagaacc acgtcctatc tcacgcggct gtatgcttcc gttggttcag . 237 60 

gtgtttttac cttgacagta ttttctcctc ggtggctttt gcggtggttg cttttaatca 23820 

gcattgactc ttcaagaaaa atatttagct gctacatctc agaggagaca gggtggaaag 23880 

catctgagac ctgcaggctc agacttagaa ccagaagtgc cctcagagtt catccggccc 23940 

tgacccagcg ggaaatgagt tcacagagaa gcgggagaac tttgccccag gccctgccgt 24000 

tgctcataac tgccccaggt ccttacattt gctccaggtc ctgccccagg ccctgcagtt 24060 

gctcataact gccccaggtc cttatatttg ctccaggtcc tgccccaggt cctgcagttg 24120 

ctctgtgtgg tgggtgtgat ctggagccct ccgcccattg ctgcacctgg ggcaggcatt 24180 

gctaattgat cccaggactc cttcctgcgg agcacgccct ggttctccag gcagccgctg 24240 

cctgtcagcc tgcagtggtt cgggagagga cacctgcttg cctggtctgt tccaaatctt 24300 

gcttctcatc ccagcacagg tagggggtgc tatgggaaag ggatcctcag ttggccctgt 24 360 

cactgctcta tcagctgggg acgtggcatc ctagtgaaaa catcatggcc gggcgcggtg 24420 

gctcacgcct ggaatcccag cactttggga ggctgaggag ggtggatcac ttgaggtcag 24480 

aagttcgaga ccagcctggt caacatggtg aaacccatct ctactaaaaa tacaaaaatt 24540* 

cgccaggtgt ggtggcgggt acctgtaatc cgagctactc gggaggctga ggcaggagaa 24 600 

tcgcttgaac ctgggaggtg gagcttgcag tgagccgaga tcttgccact gcactccagc 24 660 

ctgggcaaca gagtgagacg ctgtctcaaa atctcaaaca aacaaacaaa caaaaaacaa 24720 

acaaacaaag cgtcatttat ccagcacccc tggggaacca tgctacctgg tgttttatgg 24780 

tacctggcaa ggtgcaggtg aagttgctgc tcttgggcat tgaacccgtc ttgtttgggg 24840 

cagctcaggc cccaggcagg gtccgggttg gctctcgttg gtgtggccct ggcccatcca 24 900 

gacctatatt tctgccgtcc tgcaggtgat caatgttgat gggacgaaga ggcggaccct 24 960 

cctggaggac aagctcccgc acattttcgg gttcacgctg ctgggggact tcatctactg 25020 

gactgactgg cagcgccgca gcatcgagcg ggtgcacaag gtcaaggcca gccgggacgt 25080 

catcattgac cagctgcccg acctgatggg gctcaaagct gtgaatgtgg ccaaggtcgt 25140 
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cggtgagtcc ggggggtccc aagccatggc tcagccatgc agacttgcat gaggaggaag 25200 

tgacgggtcc atgcctgggc ataagtgttg agctcaggtg ccccgacctg gggaagggca 25260 

ggacaggaaa ggtgacagta tctggccaag gacagatggg aagggaccaa gggagctgat 25320 

tagggagtgg ttatggacta ggaatgtcgg taacaatggt tagaaagtga ctaacatttg 25380 

ttgagcacct gctgtgtgcc cggccctggc cgggagcctt cgtgcccaca gtgaccccgt 25440 

ctgcaaatgt agttccttgc cctactcgca ctggggagca ggacgcagag ccgtgcaact 25500 

cacaggtgcc aagctcagga ctccctcctg ggtctgcctg ggctgggctg tgcttgttgc 25560 

ccctgtggcc cacgcatgtg caccttccac ctgaaagcca ggatcttcag gacgctcccc 25620 

gaggaggtcg ttgtctggca caatgatttg tctcttcctg aaaaggtgac agagttacac 25680 

tggagagagc agcatccagg tgcggcaggg a^aggcctgg ggctcgcggg cagggactct 2574 0 

gtgtcctgcc ggggtcccac actgcacctg cttgtcagag gcactcagtc aatctttgct 25800 

gatgaaggat gagaggacag aggacgtgat gcttgctgct gcattgcctg cagtcctggg 25860 

tgagatgccc gggttgactc tgctgcccgt cgggtggatg tgatgtcaga tccccggctt 25920 

taaaatacga gggagctggg aattgaggga gcaggttggg gcagaaagca cagccccgtg 25980 

gaagcctgga gctgaggcag tgtgggcgac ccctggagca gtgagtgctt ccttcatggc 26040 

cttcatcgca ccctgcagtc ctcatgtagg ggatgccatc catgaattta gttttcccag 26100 

cctcctttaa aaacgcgttc atgctggggc cggggcagtg cagtggctca catctgaaat 26160 

cccaccactt tgggaggccg aggcgggtgg atcatgaggt caggagatcg agaccatcct 26220 

ggctaacaag gtgaaacccc gtctctacta aaaatacaaa aaattagccg ggtgcggtgg 26280 

cgggcgcctg tagtcccagc tactcgggag gctgaggcag gagaatggcg tgaacccggg 26340 

aagcggagct tgcagtgagc cgagattgcg ccactgcagt ccgcagtccg gcctgggcga 26400 

cagagcgaga ctccgtctca aaaaaaaaaa aaaaagtaca aaaaaaaaaa aattagtctg 264 60 

ggtgtggtat cacgcgccta taatctcact actcgagagg ctgaggcgga gaattgcttg 26520 

aacccaggag gtagaggttg tagtgagccc gtatcgtacc actgccctcc acctgggcaa 26580 

tagagcgaga ctctgtctca aaaagaaaaa aaaaaaaaga acatttatgc caggtgtggt 26640 

ggctcatgcc tgaaatccca gaactttgga agactgaggc aggaggatca cttgagccca 26700 

gaaatttgag agtgtcttcc ctgggcaaca tagagagacc tcatctctac cagaaaaaaa 26760 

aaaattagcc cggcatggtg gcatatccct gtggtcccag ctacttaggg ggctgacgtg 26820 

gcaggatcac ctgagtctgg aggcagaggt tgaagtgagc tgagatcatg ccactgcact 26880 

ccagcctggg tgacagacag agaccctgtc tcaaaaaaaa aaaaaaaaaa aagcatttac 26940 

tatccaccat ggaaggtgag actgacctgt gagtgattgt tcaaagaaca aaaaataaac 27000 

cccagagata agacaaaagg gtgcctccat gggggtgtga tttaaagctg agaaattggg 27060 

cttcttcccc ctcccctctc accccgtggt ttgctaaagg agatgggaaa aaggattctt 27120 

tttttggctg aaatatttaa cactaaatta aagccaattt taacagcact ttggttgatg 27180 

agtgaaatta acagactggc caaaaataaa cgaacggtct gtactatgtg aaaaagaggc 27240 

agctttggcc atgctgggcc aatgtgagtt ttcagggttg ctgggaatgt ctgtgaatcg 27300 

gaggaagggc ctagctggga ctctcaggag ccaaggccct gaggggcaac ttgcctggtc 27360 

cctgccctga ggcgttcact gctttcttcc tgggccagat cacaggcccg gaggctggac 27420 

cactgggctg gcactcttgc cgagctgctc cctgacttcc tgaccatgct cctttcagca 27480 

gccttgctgc actttagttt ccttgaatga aaaatgggga tgagaatagc tcctacctcc 27540 

aaggtgaatg gagtgagttc ggacaggtga ctccctggga ccagtgcctg gcgcctgaca 27600 

aggtccagtc agagcccgca ctgctgttac tgataccctt ggctgtacca ggggagaact 27660 

tggttgccat tgccaggtgt tctcccacca cccccactac tgtccctgtt tgatgtgtgg 27720 

cgggaataaa gctgtgcaca ttggagcttt tggcacatcc tggctttcag gtgaaaggtg 27780 

cgtgtgtgtt tgagggttta gcctggccaa cccagccatg aggtcggacc tgacctgggg 27840 

gtgagtcctg agctcggcac ccctgagctg tgtggctcac ggcagcattc attgtgtggc 27900 

ttgggccgca cccctttccc tgctgggctg ttgatgttta gactggagcc tctgtgttcg 27960* 

cttccaggaa ccaacccgtg tgcggacagg aacggggggt gcagccacct gtgcttctgc 28020 

acaccccacg caacccggtg tggctgcccc atcggcctgg agctgctgag tgacatgaag 28080 

acctgcatcg tgcctgaggc cttcttggtc ttcaccagca gagccgccat ccacaggatc 28140 

tccctcgaga ccaataacaa cgacgtggcc atcccgctca cgggcgtcaa ggaggcctca 28200 

gccctggact ttgatgtgtc caacaaccac atctactgga cagacgtcag cctgaaggta 28260 

gcgtgggcca gaacgtgcac acaggcagcc tttatgggaa aaccttgcct ctgttcctgc 28320 

ctcaaaggct tcagacactt ttcttaaagc actatcgtat ttattgtaac gcagttcaag 28380 

ctaatcaaat atgagcaagc ctatttaaaa aaaaaaaaga tgattataat gagcaagtcc 28440 

ggtagacaca cataagggct tttgtgaaat gcttgtgtga atgtgaaata tttgttgtcc 28500 

gttgagcttg acttcagaca ccccacccac tcccttgtcg gtgcccgttt gctcagcaga 28560 
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ctctttcttc 

tttttttttt 

tcagctcact 
gtagctggga 
gagacagggt 
acggggttgg 
aaaacttttt 
tctgtaaaga 
tttgggaggc 
cgtggagaaa 
cctgcctgta 
gcggaggttg 
aactccgtct 
ttgtctgtgg 
agcctaaaac 
gtattggaca 
ttgttgttgt 
gtaatttcag 
tccctagtag 
tttttttttt 
ctcaaacacc 
agcctattct 
agggttcttt 
ccataacctc 
ttttttctct 
cctgattttt 
cacaatcttt 
gttcttttat 
aacactttga 
ggcaacatag 
gcatagctgt 
aggttgatgc 
agacttgtct 
tatacataca 
atataaaaat 
tatataatat 
tatttataaa 
tattttttaa 
taaaatatat 
acattcattt 
ccagtggcca 
ggctcacgcc 
ggatttcgag 
aattagccag 
agaatcgctt 
caggctgggc 
aaaatgtggc 
ggagctcggt 
actggatggg 
ggctggacgg 
tggccctgga 
tcgtatgaga 
cagtgattag 
tcagaggtca 
gctgctttca 
tgtgtcgatt 
acatacccag 



atttatagtg 
tgagacagag 
gcaacctccg 
ctacagacat 
ttcatcatgt 
cctcccaaag 
gccttctaaa 
aagtttgatt 
cgaggtgggc 
ccccgtctct 
atcgcagcta 
tggtgagctg 
cagaaaaaaa 
cagcttttgt 
atgtgccatc 
actacaatgt 
tgttgttgag 
ctcactgcag 
ctgggactac 
gtggagacag 
cacctcagcc 
actgtttgta 
gtagaaaaaa 
tccagccaga 
aagatagcat 
tcacttgaca 
taaatgtatg 
tttgcgtgtg 
gagggtgagg 
tgagacccca 
agtcccagcc 
tgcagtgacc 
ggggaaaaaa 
cgcacacaca 
atatatttat 
aaaaatatat 
taatatataa 
tgtatgatat 
ataaaatctc 
aaaaacttga 
agcatctcgg 
tgtaatctca 
accagccttg 
gcgtggtggc 
gaacccaggg 
aagaagagcg 
ccttttcctc 
ggagcacgtg 
caagaacctc 
gcagttccgg 
tcccaccaag 
cagtgcgggg 
agctgtactg 
tggagttcct 
cctgggccgt 
ttacagatgc 
ttagagcaga 



caaatgtaaa 
tcttactctg 
cctcccaggt 
gcaccaccac 
tggccaggct 
ggcggaaata 
gaagagttta 
ggtggctggg 
agatcacctg 
actaaaaata 
ctcaggaggc 
agatggcacc 
aaagtttgat 
tctgccgaga 
cgccccttta 
ttgctcattt 
atagggtttc 
cctcgacctc 
aggcacacac 
ggtttccgca 
tcccaaagtg 
ttacatagct 
atatatatat 
gacgaccgtt 
ttctcttcat 
ttacaacagg 
aaatactcca 
ctggttacag 
caggaggatc 
tctctacaaa 
accaaggagg 
tgagattact 
aaaaaaaaaa 
cataatataa 
aaataaaatt 
attatataat 
tacatactta 
ataatataca 
caagttgctt 
atgctggtga 
gagatgtcta 
gcactttggg 
ccagcttggt 
atgtgcctgt 
ggcagaggtt 
aaactccgtc 
ctcacctgct 
gtggagtttg 
tactgggccg 
caagtcctcg 
gggtaagtgt 
gtgccaactg 
atgtcattag 
tcgtggagcg 
ctcaccgaag 
agaaacaggc 
gctgggccag 



catccaggac 
ttgcccaggc 
tcaagcgatt 
acccagctaa 
ggtcttgaac 
acaggggtga 
gcaaactagt 
tgcggtggct 
aggtcgggag 
caaaaaaaaa 
tgaagcagga 
attgcactcc 
tggtgtaacc 
tgagttgtga 
cagaaaaagt 
attattctat 
cctctgtcac 
ctgggctcta 
caccactcct 
tgttgcccag 
ctgggattac 
ttaaaagatt 
aggaaagtat 
gctgacacct 
cacagtcata 
tatttgatgg 
ctgcacagat 
ccgggcacag 
acttgagccc 
aaactttttt 
ctgagttggg 
ccactgtact 
tatatatata 
aaatatatat 
tataaattat 
atataataaa 
taagtatata 
tttataaata 
tttccaaaaa 
tctggtccag 
caaaacacgc 
aggctgaggc 
gaaaccccat 
aatcccacct 
gcagtgagcc 
tcaaaaaaaa 
gccagaccat 
gccttgacta 
acactgggac 
tgtggaggga 
ttgcctgtcc 
ggcaaggtgg 
ccttgatggt 
ggtgctgagg 
tgtccatgga 
tcagagaaac 
gaagtgctgt 



aaatacagga 
tggagtaccg 
cttctgcctc 
ttttttttat 
tcctgacctc 
gccaccgttc 
ctgtgggctg 
cacacctgta 
ttcgagacca 
attaaccggg 
gaattgcttg 
agcctgggca 
aaagcgcatt 
cagatctgta 
gtgctgacct 
gatttgtttt 
tcaggctgga 
gtgatcctct 
ggctgatttt 
gctggtttca 
aggcgtgagc 
ttttatgact 
aaaaagaaag 
cagcatattg 
tgctacgcag 
cgctgtgaca 
gtttgctttt 
tggctcatgc 
agaagtttga 
aataagtcgg 
aggattgctt 
ccaacctgag 
tatatatata 
ttataaatat 
atttataagt 
atatataata 
tttaaaatat 
cacatttata 
ggtgtcttgc 
aatgtgttca 
tggttctggc 
aggtggatca 
ctctactaat 
acttgggagg 
gagatcgcac 
aaaaaaagat 
cagccgcgcc 
ccccgagggc 
caacagaatc 
cttggacaac 
cgtgcgtcct 
caggctgtcc 
ggccaggact 
ctgtatcagg 
gcctgcgtag 
cgagtgactt 
ctcaggctcc 



agactttttt 
tagcgtgagc 
agcctcctga 
atttttagta 
aggggaacag 
ccggcctagg 
gccttctgat 
atcccagcac 
gcctcaccaa 
catggcggcg 
aacctgggag 
acaaaagtga 
tgtttatgga 
tgggctctaa 
ctgttctaaa 
ctgctttttg 
gtgcagtggt 
catctcagcc 
tttttttttt 
aactcctagg 
caccatgccc 
ttaagtcaca 
taaaaattgt 
cctttaagtc 
aattctgtat 
aactctttgg 
aggcttaact 
ctgtaatcac 
gaccggcctg 
gcgtagtggt 
gagccccagg 
cgacagagca 
tatatacata 
ataatatata 
aaatatataa 
taaaaatata 
atgtaatgta 
ttattttata 
tgcatttcaa 
gtagctgctg 
ctggcgtggt 
actgaggtct 
aatacaaaaa 
ctaaggctgg 
cattgcactc 
gctggttcct 
ttcatgaacg 
atggccgttg 
gaagtggcgc 
ccgaggtcgc 
tgtgttcacc 
gtgtggccct 
ggtagggccc 
cacagtgctg 
ggtgggtatc 
ccctaaggtc 
tgaccaggtc 
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tccttgcttt gcactcttgc caaaaccatg atccagaact gactttgagg tccccggacc 32040 

tcaggctcct ccgaaatggc ctcttggagg ctgctgagcc acagcttagg acccacctcg 32100 

agaggcaaat gtgctttgag ctgccaggcg tcctgggggc cctgccttgg gcacggggtt 32160 

cagacaggcc ccagatgtgt ggggcgtctt tctggacttg agttttcttt tctgtgtggt 32220 

ggacacagtg ctcacccctt aaagcacctg tgatgtgtgc agcagcccaa tccctgcctg 32280 

tcgcctgttc tgctagggaa ggaaggaata cttcaggatg gcaggacaac agaaagaggt 32340 

ccaggtttta gagcaagggc aggtcaaact tagaaaattc tggaatgagg atgtgcattt 32400 

cctcttctgg atctgctaaa agaagaggga aggaggggct gctgggggag gagcccagag 324 60 

ccgagtttac atccggatcc cgcaaggcct cccctgccct gaggtcttgt tttgtgatgt 32520 

gcttgtgtcc atcctggttt ctgccgtgtc cccaacatcc ggccaagctt aggtggatgt 32580 

tccagcacac actcaccctg tctgtgcacc tgtttttgtg tccgtaagtg ggtatttact 32640 

caccttacga gtgagccact gtgggaattc agggaggtgg cgcagtgacc acccctggag 32700 

ggatatgtgt gtggcagggg tcgagggtct cgcccttccc tgcttcctgc gcgtggcttt 32760 

ctccaggacg gggagggctg agctgaagag gtggggacag ttgcgtcccc ccgccaccca 32820 

ctgtcctgcg gtgagagcag actcactgag cctgcccttc tcccttgtgc cttccagcta 32880 

catctactgg accgagtggg gcggcaagcc gaggatcgtg cgggccttca tggacgggac 32 940 

caactgcatg acgctggtgg acaaggtggg ccgggccaac gacctcacca ttgactacgc 33000 

tgaccagcgc ctctactgga ccgacctgga caccaacatg atcgagtcgt ccaacatgct 33060 

gggtgagggc cgggctgggg ccttctggtc atggagggcg gggcagccgg gcgttggcca 33120 

cctcccagcc tcgccgcacg taccctgtgg cctgcaagtt ccccaacctg gcaggagctg 33180 

tggccacacc cacgactgcc cagcagcctc accctctgct gtgggagttg tccccgtcca 33240 

cccctgggtg cctttgctgc agttatgtcg ggagaggctc tggtgacagc tgtttcctgt 33300 

gcacctgctg ggcactaggt cccagctaat ccctgtgcca ggactctaat ttcaccctaa 33360 

cacacatggt ggttttcatt gctggggaag ctgaggcctg agcacatgac ttgccttagg 33420 

tcacatagct ggtgagttca ggatccccca gagataccag ggccagcact cgatccccac 33480 

ccagccctga accccaccat gtgctgggat tgtgctggga gtgtccacac gcctgggacc 33540 

ccagggctgg tgctctcatc tcctttttcc agatcatgag aatgaggctc agggaagttt 33600 

gaaaaaaacc tatcccaagt cacacagcaa caggagcagg atttgaaccc agaaaagggg 33660 

accgcacact ctgttctgct agagtagtta gctgtcctgg gtgatatggc aggtgacagg 33720 

ggcaactgtg cttaacaaag gaacccccat cccccctgcc aagttgggag actagaaggt 33780 

caggggcaga agctctgaag ggccaggtgc agtggctgac acctctaatc ccagcacttt 33840 

gtgaggccaa ggcgggcaga tgatttgagc ccaggagttc aagatcagcc tgggtaatgt 33900 

agtgagacgc catctctaca aaaaaatttt ttaaaaatta gctgggcatg gtggttcatg 33960 

cctgtagtcc aagctacttg ggaggctcag gtgggaggat tgcttgagcc caggaggttg 34020 

aggttgtggt gagctgtgat catgccactg cactccagcc tgggcaatag agtgagaccg 34080 

tctccaaaaa aaaaaaaaga agaagaaaaa gaagctctga ggctccaagt ccccaggcac 34140 

cccttggctt gagggcagac aagggaggag agggtcacct gggcagccct gacttttgtc 34 200 

ccctggcaaa gggaccttca gtgaccttgg ccctaggaga gcctctgagc acgtcagcca 34 260 

tgtcgaaccg ctcaggaagg gcagcaagaa tttggcttct gacctctgcc tctcctactc 34 320 

gccatctgca ctgggtgtgg ttgtgcccat tttacagatg aggaggctgg ggcatcgacc 34 380 

agctgaatgc cttgtcccag gtactgcgta ggcagagctg gcagttgaac cccgtgtcct 34 4 40 

ggttgtcgct gggggtgggc tgcaccctga cttgtgaggc cagtagcaag gtttgcacgt 34500 

gacttcgtga ccgtcaccca gctctgcagc acatcccgtg acccagctca tccaggccgc 34560 

atgcaaacct gttgccaggc gagaaaccag tcaccgcaca gctgtggttg cctgaaatga 34 620 

ttaagctcat taatcacccc ggagtgagga cagactcaga tgaaaaccag caaaagccct 34 680 

ggaaactcat gtgaccctgc caatgagggc ggccatgtgc attgcagcct ggccgtcact 34740 

cctcggtacg tgttttggac ttaaacgctc cggatgttta ctgagtgctt gattaataac 34800- 

atggaaggcc tggtctcatt gctgtgggag tgaaggatgc acagccaggc ctgacatgat 34 8 60 

gagaacaaga acctggagtc tcgctgcctg ggtggtaatc ctggccctgc cacttagcaa 34 920 

ctgtgtgact gtagccaggt cacttaattt tgctagatcc tgcctgcgct tcagtggatc 34 980 

ttgctggttt tccaaggtgg ccaaacactt taaggcattc atgtggtcgc taggctgcag 35040 

ggttgaaccc tggctcaccc cgcagggcgc cgtgtgctct gtggcctggc tgtgcctttg 35100 

ctgacaccgt gcccgtgtgt gttcatgcag gtcaggagcg ggtcgtgatt gccgacgatc 35160 

tcccgcaccc gttcggtctg acgcagtaca gcgattatat ctactggaca gactggaatc 35220 

tgcacagcat tgagcgggcc gacaagacta gcggccggaa ccgcaccctc atccagggcc 35280 

acctggactt cgtgatggac atcctggtgt tccactcctc ccgccaggat ggcctcaatg 35340 

actgtatgca caacaacggg cagtgtgggc agctgtgcct tgccatcccc ggcggccacc "35400 
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gctgcggctg 
gcctcatggt 
ctgaaaggag 
actgcacaag 
ttcatcagga 
ttgggtgtga 
agtgctttcc 
catttcacgt 
aactggggtg 
gaacccttgc 
aagtgggcct 
ggagtacagg 
atgattccct 
agtctcacgc 
gggaacttga 
gcctgggcca 
tccctccctg 
gaggggttgc 
tggatttcag 
aagccaggca 
atgttctggt 
ggcccagctt 
ggagtgtctt 
cataggcaca 
ccagtgttca 
tattgggctg 
gcagatcacc 
ctactaaaaa 
ggaggctgag 
agcaccactg 
taaataaaag 
gcagagacac 
ctggtccccc 
gagtggcatg 
ctgtctttgc 
agtcggatga 
aggaacgtca 
cgccagaaca 
tgcggggtgt 
tagcagaggg 
tagtcacagg 
ctgggtttcc 
tttcaagaaa 
ttatccttta 
ctgaaagaca 
gccagggcac 
aaatagcaaa 
cgtaaacgtt 
tatgagtgtg 
gaaatcaaat 
tgagcccagg 
gccggctgta 
aatgtgaaac 
ctttnttttt 
ctggagtgca 
tttcctgcct 
agatttgcat 



cgcctcacac 
cccccgcacc 
cttctcatct 
ctgcatgatg 
catagataaa 
aaaccagtca 
atagcgctcg 
acaaggaacc 
ggttgagccg 
caggcagcct 
ctttgttctc 
tgtttcctgt 
acgcagaact 
agacctggtc 
ggcttgcaaa 
gagtctggga 
gggccatgac 
catgactcag 
accagpcact 
ggactcaggc 
tctgctagag 
gattctgggg 
ctctcgggct 
tagcaaaccg 
ctaccattac 
gacgcggtgg 
tgaggtcagg 
tacaaaatta 
gcaggagaat 
cagtccagcg 
aaaaataaat 
atatgacacg 
cgtctttccc 
ctatcccgtc 
ctcctctagc 
tcccggacga 
aagccatcga 
tcaagcgagc 
gcttcccaag 
aggaaacaga 
gtatgaccct 
ccgttgttta 
gaaaatggtc 
ttttgtggga 
cctatgcaaa 
agccctcaaa 
gtgccttttg 
aatgctgcaa 
gaagggtgta 
ggaaagttct 
gaggtcagag 
gttttttgca 
cgtgctgtgc 
tttttttttt 
gtggcatgat 
cagcctcctg 
ttttgaaaca 



tacaccctgg 
tcactccctc 
ggggttcctg 
ctacctgggg 
tggccaaaac 
caggggaagg 
tttactcctc 
cgatgcacag 
gggttgactg 
ggcatcagct 
atgtcagtga 
tggcgggctc 
tgatagttta 
gcaggcgggg 
ggttaagggg 
cttcccatgc 
cttagagagc 
agtccccgtc 
gtagcccgct 
caggagactc 
agctggttct 
ctgccttgtg 
tgttgactgt 
cacacacagt 
ctgccttccg 
ctcatgactg 
agttcaagac 
gctgggcatg 
ggcgtgaacc 
tgggcaacag 
ttatgatcta 
gagaatgaaa 
gtggacctcc 
ctccagctcc 
gcccaccacc 
ccagcacagc 
ctatgaccca 
caaggacgac 
gcgctcctct 
ggatggctct 
gagatgcgtg 
agtagtcatg 
gtgtgtcatg 
actgtacagt 
atcgcacagt 
atctccatca 
cacattaaat 
taaatacggc 
gcgcatgagt 
caccccaggc 
acggaggtgt 
ttcgcttaat 
tcaaattgtc 
tttttttttt 
cttggctcac 
agtaactggg 
actgctaggc 



accccagcag 
gttagatcag 
ggtgtacata 
gtccaggtcc 
tcctcagctg 
gtgcttgctc 
aagcctggag 
agaggtgtgg 
tggcaggctg 
cggggaattt 
tgagaagagt 
ttcccccatg 
taaagctctt 
ctggtcttgc 
ctgttcgagg 
ctgggctgtc 
caaatggagg 
ctgtggccgg 
gacggtgcgc 
tgttagctgt 
tcggatcctg 
gtggcatgtg 
gcccggtttt 
cgtgggtatg 
tcttaagtgt 
ttatcccagc 
cagcctggcc 
gtggggcatg 
cgagaggcag 
tgcgagactc 
tttcaaaaat 
ttccccatag 
agcggggcag 
tctgtggctt 
ttcttgctgt 
ccggatctca 
ctggacaagt 
gggacccagg 
tgctggtttc 
gggtgaatga 
gaaccccgag 
gtggggtcag 
caggggtagt 
catttctgct 
aaaaatgaca 
atgacatgta 
ggttagctat 
acttcacctt 
ttttgtgaaa 
gtggagaaga 
gtgtgtgggt 
gtttctcgtg 
ctaatacatc 
tttgaaatgg 
tgcaaccttt 
attacagggc 
tgggcgcggt 



ccgcaactgc 
gctggttctg 
gatggttggg 
aggctggatg 
gaaggtcctg 
atactgccag 
ggtggggagt 
caacccatcc 
gctcaagagt 
ttgccctgac 
gactttccta 
acatcagcag 
tgtcatccag 
ctgtcccagc 
cccacgctgg 
tttggtcctg 
tgcaggtaac 
cagtacctgg 
tcgaagtgcc 
tgagagggag 
gtaccagtgc 
ctgctcactg 
ccgcagttca 
agtttcacta 
tcatcattta 
actttgggag 
aatatggtga 
cctataatcc 
agcttacagt 
catctcaaaa 
aacacatgta 
cgcaccccca 
atgctgagcc 
acagacaccc 
tcagccagaa 
tcctgcccct 
tcatctactg 
caggtgccct 
caggctgctg 
tgacttgggc 
actgtgatta 
accccacagg 
tggtactggt 
acaatgctgt 
caactcatag 
agaaaagaga 
atcccacaat 
gggaagatct 
cactggaagg 
gtgggtcatg 
gtgaccctgc 
gaggaaattg 
attgcattgg 
agtctcactc 
gcctcctatg 
atgagccacc 
ggctcacacc 



agccgtaagt 

ggagctgacg 

taggttgtgc 

gacttgttgc 

ggcaggatct 

cacagtgctg 

agcatggtcc 

aaggccatac 

ccctgctcct 

ccttggaagc 

tggcccctct 

cgagctggtt 

gccccgttgg 

tgcatggatg 

caggagatgg 

ttgctcacca 

ccacggcaag 

tgcaacgact 

acagcttctg 

aggccaacgg 

actgagagga 

acaccctcga 

ctggtgcaca 

cattccacca 

aaaataaatt 

gctgaggcgg 

aactccatct 

cagctactca 

gagcccagat 

aaaaaataaa 

ctttgaaaca 

agagacagcc 

gcctgttgtc 

acctgcagcc 

atctgccatc 

gcatggactg 

ggtggatggg 

gtgggaaggg 

cccctgtcct 

ttcgattatg 

tatgtagaaa 

acttttgtct 

taatccaggt 

atatgctctt 

ggaaagcggg 

ggaacctggg 

actgtgcatt 

ggagttggct 

aggattgtgg 

gccccagcag 

gcagttccct 

tgcatgagca 

aacagattgg 

tgtcaccagc 

ttcaagtgat 

gcggccggcc 

tgtaatccca 



35460 
35520 
35580 
35640 
35700 
35760 
35820 
35880 
35940 
36000 
36060 
36120 
36180 
36240 
36300 
36360 
36420 
36480 
36540 
36600 
36660 
36720 
36780 
36840 
36900 
36960 
37020 
37080 
37140 
37200 
37260 
37320 
37380 
37440 
37500 
37560 
37620 
37680 
37740 
37800 
37860 
37920 
37980 
38040 
38100 
38160 
38220 
38280 
38340 
38400 
38460 
38520 
38580 
38640 
38700 
38760 
38820 
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gcactgtggg aggccgaggc aggtggatca cctgaggtca ggggttcgag accagcctgg 38880 

ccaacatggt gaaaccccgt ctctactgaa tatacaaaaa tcagctgggt gtggtggcgg 38940 

gtgcctgtaa tcccagctac tcaggaggct gaggcaggag aattgcttga acccaggagg 39000 

cagaggttgc ggtgagccga gatcacacca ttgcactcca gcctgggcaa caagagcaaa 39060 

actccatctc aaaaaataaa aaatagaaaa acaagtgctg tagcggaagt gagcactttg 39120 

cggagtcagg cttgtgtggc ctgttccaca aatgatgtgc tcacggtggc ctcaggccca 39180 

cctggagtct gcagcatggg gcacaacagg ttcattagtg tagaattcca ggacaggcct 39240 

ggctcctaag cagccttctt ttacaaaaac tgcagagccc gcctgtatcg tagcactttg 39300 

ggaggccgaa gtgggtggat cacgaggtca ggagttcaag accagcctgg ccaacatggt 39360 

gaaaccccat ctctactaaa tatacgaaaa ttagctgggt gtggtggcac gcgcctgtag 39420 

tcccagctac tcgggaggct gaggcagaat tgcttgaacc tgggaggtgg aggttgcagg 394 80 

gatctgagac catgtcattg cactccagcc tgggcaacag agcgagacgc catctcaaaa 39540 

aaaaaaaacc tacagagcca cacggcctct ttctccaccg agtgttggtg tgggagcttg 39600 

tgttattgtg gtgaaatctt ggtactttct tgaggcagag agaggctgag cgcctggaga 39660 

gactttcaca tgggtcgcca tgtccgccgt cggtttcgct gttgtgctcc ccatctgaag 39720 

gctggtgccg tccagacagg ctggacgccc ctttccacca gatccttcct cccgcagcag 39780 

tttctagtta cgttgtactg tgaggtctgt gtccttggtt gatggcaaaa gtcagccgaa 39840 

ttgaaattca gagccatgcc tggctccctg gagcttctct cctgggcagc tgtgatcatt 39900 

gcctctgctg tggtgtgggt ggtggaaatg gattcctttc atcttgcttg ctacaggtga 39960 

ctgtcacgtg gagtcctttg gagagaggga cgtgttaatt gatggatgtg gctcccatgc 40020 

tgagaaagct cctgggcgta cattgcctta gagtttcatt ggagctgcgt tcttttatgg 40080 

tgtctgctag gcagaagtga tgaagacttg gaagaaaacc cagaaggttt tccacttaat 40140 

ttggaaaatg tgcttttccc ctcctgtgtc ttttgctaag gtccagcctc ctgcagcctc 40200 

cccgctctgt ggactctggc tttgattctt tattaggagt ccccctgctc ccccaaaaga 40260 
tggtgtctaa attatcatcc aattggccga ggttttgttt tctattaatt gtttttattt - 40320 

tttattgtgg taaatttata taacataaaa tttgccattt taattgtttt gttattgttg 40380 

tttttgagac agggtctcac cccagtgccc aggctggagt gcagtggtgc gatcatggct 40440 

cactgcagcc tcagcctcca gggctccagt gatcctctca cctcagcctc tctagtagcc 40500 

gggactacag gcatacacta ccacatctgg ctgatttttt gtattttttt tttattgtag 40560 

agacccgcta tgttgcccag gctggtctca actcctggac tcaagccatc ctcccacctc 40620 

accctcccaa agtgctggga ttacaggcat gagccacaac acccagccat tttaattttt 40680 

tttttttttt ttgagatgga gtctcactct atcgcccagg ctggagtgca gtggcgtggt 40740 

atcaactcac tgcaacctct gcctcccagg ttcaagcgac tctcctgcct cagcctcctc 40800 

ccgagtagct gggattacag gtgcccatca ctatgcctgg ctaatttttg tattttttag 40860 

cagagacggg gtttcaccat gttggccagg ctggtcttga actcctaacc tggtgatccg 40920 

cccgcctcgg cctcccaaaa tgctgagatt acaggtgtga gccaccgtgc ccggcctttt 40980 

tttgtttttg agacagggtc ttgccctgtc acccagactg gagtgcaatg gtgggctctt 41040 

ggctcactgc agcctccgcc tcccaggctc aagttgtgca cctccacacc tggctaactg 41100 

tattttatgt agagacagat ttcaccatgt tgcccaggct gggcttgaaa tggactcaag 41160 

cagtccaccc acctcagcct cccaaagtgc tgagattaca ggcgcgagcc accgcaccca 41220 

gcccatttta cctattctgc agttgacagt tcagtggcat tcagtcagtt cacgaggtaa 41280 

ccatcactgc cattcatctc cagactactt caccttctcg gcagatgtcc gaaactgtcc 41340 

gcattgaaca cactcctcat ctccctctga cagccaccat tctactttgt atctctctct 41400 

gccttctcta ggtacctcat gtaagtggaa ttataccaat atttgccctt gtgtgactgg 414 60 

cttctttcat gtgacatggt gtcctcaagg ttcatctgtg ttatagcctg tgtcagaatt 41520 

tccttcctta aagcctgaat aataacccgt tgtaaaggct gggcgcggtg gctcacaccc 41580 

tctaatccca gcattttggg agtccgaggt gggcagatca cttgaggtca ggagtttgag 41640* 

accagcctgg ccaacatagt gaaaccctgg ctctactaaa agtacaaaat tagctgggtg 41700 

tggtggcgcg cacctgtaat cccagttact caggaggctg aggcaggaga atcgcttgta 41760 

cccgggaggc agaggttgca atgaaccaag attgtgcctc tgcagtccag cctgggtaac 41820 

a.gagtgagac ttcctgtctc aaaaaaaaaa aaaatcatcg gatggatgga cggaccactt 41880 

cttgttattt atccatccac gggtgctagg tttcttccac ctttggttgt cgtgaataag 41940 

gccactatga acatttcctt ccgtggtgaa ggttttgtac tagtgaggaa aaggcgtgtt 42000 

tgtggtgttg cataggattc tggtaagaaa gtttgcacta accataagta tttgtactac 42060 

attaaaatga aagctcaggg gccgggcgcg gtggctcacg cctgtaatcc cagcactttg 42120 

ggaggccagg gcgggcggat catgaggtca ggagatcaag accatcctgg ccaacatggt 42180 
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gaaaccccgt ctctactaaa aataccaaaa aactagccag gtgtggtggc gggcacctgt 42240 

agtcccagct acttgggagg ctgaggcagg agaatggcgt gaacccggga ggcggagctt 42300 

gcggtgagcc gagatcgctt cactgcactc gagcctgggc aacagagcaa gactccgtct 42360 

cacgcaaaac tctgtctcac gcaagactcc gtctcaaaaa aaaaaagagt tcagggttta 42420 

tgaaactggc cagccgcgta aagtttgctg tgttgttttt gtgcccggga ggagtgtggc 42480 

cagggtgtca cgtcacacag tacacgtttc tcagatggtg gttctccaga ctgctgtccc 4254 0 

aaagtctgtt tttgcatctg gttcccacag acccaccctc cacggtgagc ctgattttgg 42600 

ccagggtagc tggaatcttg cttgtctttc agcccggcag ctgtaccagt ccagggtcca 42660 

cagctagtgg cttttaggaa ggaatttgtt cagttggctt tgacacatgg ccccctaggg 42720 

tccacagctc tgtagtgatg tggatgttgt tatctacaaa gacacatgat ccttcgtgtc 42780 

cagatgaaag tgatgatgtc tttgcagctg cccagcaagg ctgtgtgtgt gtgtgtgtgt 42840 

gtgtgtgtgt gtgtgtgtgg tgtgtgtgtg gtgtgtgtgt gtgtatgggg gagggaggca 42900 

ccctttccat ctgggggtgt gtgtgtgtgg ggtgtgtgtg tgtgtgtgcg cgtgtgtgtg 42960 

gtgtgtggtg tgtgtgtgtg tatgggggag gcaccctttc catctgggtc caagagactg 43020 

ggcctgggga agacgcttct ttttatctac ttagagactt tgttttattt gtattttttt 43080 

• gagacagggt ctcactctgt cacccaggct ggggtatggt gatatgagca tagctcactg 43140 

cagcctcggc ctcccaggct gaagcgatcc tcccacctca gccttctgaa tagctgggac 43200 

tgtaggcgtg cgtcaccata ctgagctatt gttttttttg tttggttggt ttaatttttt 43260 

ttgatacaga tggagtcttg ctatgttgcc cagactagtc tcaaactcct gaactcaagt 4 3320 

gattctccca cctcagtttc ccgacattct gggatcacag gtgtgagcca ctgctgtctc 43380 

cctgttttat taactgctga aagacctaga taaagaaagt ctgaaaagac ttactatcag 43440 

agcaccatcc taagatgatt ccctctgact caatggagag ggaggggagc ttttccttca 43500 

ggcctgggtg gcaggagccc aggtgctcca ggccccattt gccccaggcc aaatcactcg 43560 

ggaacttgga tgcagctgtc tttcagggta acccaaagga accagatccc cgcaggcagt 43620 

aggcttctgg gctgtcctct cctcctacgt cagctcagta agagcccttc gaagggatgc 43680 

tgtgtcggag gccccaaaag cccaggctca tccctgagat gcacagggtg ggctgggctt 4 3740 

aggcagcgct cgagcatctc ctggacggtg accccagaga gtgtggagac ggagagtcct 43800 

tgagagtcac tgagagacgt ggctgccctg ccttcccaag aggggctctg agtcattccc 4 3860 

cacactcacc tgcccctacc caccctcacc tggcccccag cctcacctac ccccacatct 43920 

gtaccgatcc ctttacccgc accttcccta cccaccctca cctcccctgt accttcacct 43980 

cccccactca cccgcccctg caccctcacc tgtcccccac cttcacctaa cccccaccct 44040 

cacctgccct cccctcacct ggcctccttc cgttggggaa ggggttgtaa ggggcggccc 4 4100 

ccaaactgtc tgtcctggtg ccctgcagag aaaacagtac gtgagggccg cagtccaaaa 44160 

gcttgagtcc tggaaggtgg aggagacagg gatgtgttgg gaagggcccc atggtcttgg 44220 

atcccttctc gactgtcaat ggggccttca tgggagcgcc agtctagtga tgcacagctg 4 4280 

ggtgcccggc gggtggctga ggaggcctaa agtccgaggc ggcaagagct cttccagagg 44340 

ctgttgtcct aatcgctctg gcatactcag gcgggcacgt agttaggagc tgattggaga 44 400 

ggagagaccc ccacaccaat actgggattt gactttcagg ctaaacttga gaagtgtggc 44 4 60 

ctctgctgtc ctgccagagc tctccagcca gtgcccaggg ctctccagcc agtgcccggg 44520 

ggtctccacc agtgcccggg ggtctccgcc agtgccaggg gtctccgcca gtgcccaggg 44580 

gtctccgcca gtgctcagga gtcttggttt ctttgtctta cagccctttg ttttgacctc 44640 

tctgagccaa ggccaaaacc cagacaggca gccccacgac ctcagcatcg acatctacag 44700 

ccggacactg ttctggacgt gcgaggccac caataccatc aacgtccaca ggctgagcgg 44760 

ggaagccatg ggggtggtgc tgcgtgggga ccgcgacaag cccagggcca tcgtcgtcaa 4 4820 

cgcggagcga gggtaggagg ccaacgggtg ggtgggggtg ctgcccgtcc aggcgtgccc 4 4880 

gccgtgtctt ctgccgaatg ccagcctctc acaggctggg gagactttcc accctgggga 44940 

tccaatgggt ggctttccag ggtcccaaaa gcaaacacag gctctttcac agcccctcca 45000- 

ggaaagcaga aagccccaag ggctggaagg gaagggggag ctctgctgag aggttacaag 45060 

gcagcgctgg ccgacgggag ttgcagttga taggttttgt atcatccttg ttaaacttga 45120 

accctgtgca gaaatccctt ccacggcatg ggggctgcct gttgactcgc tcctgttcca 45180 

ccacagggag ctcctgggct tcttcctccc agaggccccc gacgctccca cctgttggtc 45240 

gtcagagctt ctggttggtg ggaaggcacc caggaccttg aggtctccag agagaaaagc 45300 

cagggaaaga gggagaccga aacccatgtg acatgaaact caggctccaa actgagcacg 45360 

ggaacgtttg gggacaggag cgcgatggcc ttcctcagat agctgggggg ctggcatgaa 45420 

gacgggagct acagccagca caggtcctgg gccgggagcc cagagattga gccctgactc 45480 

tgtcacttac tggccacgtg accttgggcg ggtggcatag cctcttggag actcagtttc 45540 

ctcattggta ggagtgacgg ccacagtggt gcggcctctg cagcacacgg ggggctcggt 45600 
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gggcggaagc cccgggtcta taaggcggct gtgcaggagc cagccgagct ggtctcccaa 45660 

cagccagggc tccggggtcc ttagcagctg tggggggcct gcacctgttt cccatggctg 45720 

ctgtcagaaa ttaccagaag ccaggtggct gagagtaatg gacacttgtt ctctcacagt 45780 

tcctgagggc tgaagcccga gatcgaggtg tgggcagggc cctgcgccct ctgaaggctc 45840 

tgagggaacc tttgggcttc tggtggctcc aggcacccct tgacttgtgg tcctgtcact 45900 

ccagtctctc tgtctggctg cacatggcgt ggcctcttct gtaccattga aggacacttc 45960 

agttggattt agggcctacc ctcacccatt gtggtcgtat cttgatcctt catgacattt 46020 

gtaaagaccc tgcttccaaa taagctcaca ttctgaggtt ctggggtgag cgggaatttg 4 6080 

gagagcattg ttcaactagt atagaatgtg acctgtcagc ctcgggcagc cctgagaggc 4 614 0 

aggggctttc cacagcccag ctgggtgccc tgggctccgt gctgtccgag gagacgccat 4 6200 

ccccacaccc gtccttcacc cgccaccctc ccgcaggtac ctgtacttca ccaacatgca 4 6260 

ggaccgggca gccaagatcg aacgcgcagc cctggacggc accgagcgcg aggtcctctt 4 6320 

caccaccggc ctcatccgcc ctgtggccct ggtggtggac aacacactgg gcaagctgtt 4 6380 

ctgggtggac gcggacctga agcgcattga gagctgtgac ctgtcaggta cgcgccccgg 4 6440 

ggcctgccct aaccgcagac acccggcctt cattgtcagt aatggcagca gctgccacat 46500 

tgtccgagac ctgccgtgag cccagtgccg cgccaggggc tttgtgtgta gcgtgttttg 4 6560 

tcctcacact gacagctgta ggctggggtt ctgagtgagc cccacagggc agaggcagaa 4 6620 

aatgagtctc agagagggtg agcgagctgc ttggggcccc acagcaggag atggagcagg 4 6680 

actgcagcct agcctctgcc cccagcacct gcgcaagaag ctgctctgct ctggactgtg 4 674 0 

ttaggctgcg agggctggag agaaatgaga gttggtgctt agagaggggg cgcaggtccc 46800 

catggctttt cctcttatga tgaggtagat gggtgaaggg aggggccatg cttgcagggg 46860 

ccagtgaccg aggcccgccg ttggaactga tggccttcat cccgagccca gcccaggtgg 4 6920 

gagcagggct ttccgagggc ttgtcttggg tcggcctgct tccagggact ctgctgcagc 46980 

tcccacccct gtccaaagca tggaatcccc caggctccct ggcagtcctg tcaacctctg 4704 0 

tcctcccaag ctgagtgtgg ggcaagttct ggaggtcagc actgctcagg ggggcccacg 47100 

ggctgcttgc aggggccaac cgcctgaccc tggaggacgc caacatcgtg cagcctctgg 47160 

gcctgaccat ccttggcaag catctctact ggatcgaccg ccagcagcag atgatcgagc 4 7220 

gtgtggagaa gaccaccggg gacaagcgga ctcgcatcca gggccgtgtc gcccacctca 47280 

ctggcatcca tgcagtggag gaagtcagcc tggaggagtt ctgtacgtgg gggctggcag 47340 

tggggtgggc agggtggcct ctaaacccga cccctggagg aggctggagg ccagtgcaag 47400 

atcctgtgtg gcctcagcca ggcggtggtc tctgccagat gccaactgtt gcccgctggg 474 60 

gttcagcgac atgtccgaat gtcccgaggc ctctgaggtt gttttctttt gccgcagaac 47520 

aaatcaccac gaacagcgtt ttaagacaac accaactctt tttttttttt ttttttttga 47580 

gtcaggatct tgctctgttg cccaggctgg ggtgccctgg tgcaaacaca gttcactgca 47 64 0 

gcctcgacct ctgggcttaa ttaagtgaac accttgcctc agcctcccag gtagctggga 47700 

ctacaggtgg gcaccaccac acctggctaa tttttttttg tagagacggg gtttccccat 47760 

gttgcccagg ctggtctgca actcctgggc acaagctatc tgcctgctgt ggcctcccaa 47820 

, agtgctagga ttataggtgt gagccactgg cctgacaaca cccacggatt gtctctcagt 47880 

tctgtaaggc aaagtccagg cacagcgtgg ctcacctggg ttctctgctc agggtctcac 47 94 0 

ggggccagaa tcaaggtgtc aggaacgctg ggccctcagc ggaggctctg tggagaaatt 48000 

agcttccttg ctcactcagc aggtagcagt tgtgggatcg aggttctgtt ttctctctgg 48060 

ttattggtcg gggaccactc tcagctccta gaggccaccc caggtccttg ccccgtggcc 48120 

ctctctgcct cagcagtggg ggctccctgc gtcagtccct cccgcacctt gagtctctct 48180 

gatttgcttc taaagggccc tgtgattcgg ctcagccacc tttagattag gttagcctcc 48240 

cctttgatag actccaagtc ggctgattaa taaccttact cacatctgca gaatcccttc 483O0 

tgccacataa ggtcatgacg ccgtgctggg gactggggtg ggaaattacg gggtcattta 48360 

ggattctgcc tgccactgcc ttgctgtgtc ccagggcttg ggggaggggc ctccacagct 48420- 

gggaccacag tccttcctcc cctccatggt aaccatctga ggattacttg agaccagcct 48480 

gggcaacatg gtgagaaccc atccctacaa aaaatacaaa caaaaaggga ccaggctggg 4854 0 

cttggtggct catgcctata atcccagcac tttgggagac caaggtgggc tgatcacttg 48 600 

aggttgggag ttcgagacca gcctgcccaa catagtgaaa tcccgtctct actaaaaata 48660 

caaaaattag ctgggtgtgg tggcaggcgc ctgtattccc agctactggg gaggctgagg 48720 

tgggagaatt acttgaacct gggaggcgga agttgcagtg agccaaaatt acgccactgc 48780 

actccagcct aggcaataga gtgagactcc gtctcaaaaa aaaaaaaggg ccaggggtgg 48840 

tagtgacaaa gagaccctat cccaaaaaaa ccgaacactg aatccttgag actgagtaag 48900 

gacactgtga aatttttctg ggtggggcag ggaacagagc gtcttctgtc atttcttcca 48960 

cctgggtgtg gtcagctctc cctccaagct gcctcctctt cttctcattg tccgggtgtt 49020 
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ggacacattt ggttaactgg atagaataac gcgagttccc agggacttgg tccatttgct 4 9080 

attttatttt attttatttt attttatttt atttatttat ttatttattt atttatttat 49140 

tgagatggag tttcgttttt gtcgcccagg ctggagtgca gtggcgcgat ctcggttcac 49200 

tgcaacctct gcctcccagg ttcaagtgat tctcctacct cagccttcca agtaactggg 49260 

attacaggca cccaccacca taccaggcta atttttttgt atttttagta gagacgggtt 4 9320 

ttcgccattt tgcccaggct ggtcttcaac tcctagcctc aggtgatcca cgcacctcgg 49380 

cctcccaaag tgctgggatt acaggcatga gccaccacgc ctggcaccat ttgctatttt 49440 

aattcccatg tgtattagtg tcccacggct gctgtaacaa atgaccacaa actggatggc 49500 

ttaaagcaac agaaatggat tcccccaatg tgctggagac cagaagcctg cgaccaaact 49560 

gttgggaggg ctgtgcttcc tctgggggct ccagggagga tctatttgtt ggcccttcca 4 9620 

gtgctgtggg tgccagcgtt ccacacttgt ggatgcgccg cctcaacctc tgcccatctt 4 9680 

catgtgtcca tctcctttgt gtctgcgtct ttacctcttc ttcttgtctg tgttgcctct 49740 

tataaggacg tttgtcattg ggtttagggc ccacccaaat catccgagat gacctcgtct 4 9800 

tgagatcctt aacctgcaaa gacccttttt ccaaaaaaag gttatgctca cagattctag 49860 

gccttaagac atgggtgtat ctttctgggg ggcactatcc aaccccttat acaatgaaag 4 9920 

acgggaagag ggccaggtgt ggtagttcac gcctgtaatc tcagcacttt aggaagctga 49980 

agcgggagga tcacttgagc ccaggagttt acaagtagct aggcaacatg atgagacccc 50040 

atttctacaa aaagtaaaaa aaaaaaaaaa aaaaaaaaag ccaggtgtgg tggctcacac 50100 

ctgtaatccc agcactttgg gaggctgagg caggcagatc acgaggtcag gagattgaga 50160 

ccatcctggc taacacggtg aaaccccgtc tctactaaaa atacaaaaaa ttatggccgg 50220 

gcgcagtggc tcccgcctgt aatcccagca ctttgggagg ccgaggtggg tgaattacaa 50280 

ggtcaagaga tcgagaccat cttggctaac acggtgaaac cccatcaaga tcacaaggtc 50340 

aagagatgga gaccatcctg gctaacacgg tgaaaccccg tctctactaa aaatacaaaa 50400 

aattagccgg gcatggtagc gggcgcctgt agtcccagct gctcgggagg ctgaggcagg 504 60 

agaatggcgt gaacccggga ggcggagctt gcggtgagcc gagatcgctc catgccattg 50520 

cactccagcc tgggtgacag agtgagactc cgtctcaaaa aaaaaaaaaa aaagaaaatt 50580 

agccaggcac agtggcaggt gcctattgtc ccagctactt gggaggctaa ggcaggagaa 50640 

tggcatgaac ccgggaggtg gagtttgcag tgagccgaga tcatgccact gcgctccagc 50700 

ctgggcgata gagcaagact ctgtctcaaa aaaaaaagcc aggcatggtg gtgcatgcct 50760 

gtagtcccag ctactcaaga ggctgaggca ggagggttgt tcgacccacg gagatcaagg 50820 

ctacagtgag ccatgatcgc accactgccc tccagcctgg gtgacagagt gtgaccctgt 50880 

ctcaaagtaa gtaaatagga ggagagacaa gtgggcagtt cagactgatg gtatgggcac 50940 

agtagagact ggtgcagaca ggctggcctg tgatgtcaag caacttctgt aactgtttcc 51000 

ggcatccatt tgtgtgtcaa tttccgtgtc agtaggaaga ctctgtaggc tgccaagagg 51060 

aataagtggg aggatcctcc cagagaggcc gggcctgcag gagggccagt tctcatgagt 51120 

tcttatttgg cccctaccct ccaggctgtg gttctgaggt gggagacaga gcctgacctc 51180 

tgtttgtctt gttttgtctt tgcagcagcc cacccatgtg cccgtgacaa tggtggctgc 51240 

tcccacatct gtattgccaa gggtgatggg acaccacggt gctcatgccc agtccacctc 51300 

gtgctcctgc agaacctgct gacctgtgga ggtaggtgtg acctaggtgc tcctttgggg. . 51360 

tgatggacag gtacctgatt ctctgcctgc taggctgctg cctggcatcc ttttaaaatc • 51420 

acagtccctg tggcatccag tttccaaagc tgattgtgtc ttcctttgcc ctcctttctt 51480 

ttctactatg tgcattcggt gctatgaatt ttcctctaag tactgcgttt cctgcatctc 51540 

acaaattttg ttacattttc attttcaggt agtttgaata tttttacact tctcctgaga 51600 

tgacatcttt ggctcatgtg ttatttagaa gtgttgctta gtttctaaag agttggggct 51660 

tttccagctg tctctctgca actgatttct aatttaattc tactgtagtc tgagagctta 51720 

ttttatatga tttctgttat tttaaatgtg ttgggtgtgg tgtttttgtt gttattgttt 51780 

ttgtgtcttt ttgttttgtt ttgcttcgtt tgttttgttt ttgagacagt gtcttgctct 51840* 

gtcactcagg ctggagtgca atggcgcgat ctcagctcac cgcaacctct gcctcccggg 51900 

ttcaagtgat cctcttgcct cagcctcctg agtagctggg attacaggtg cacgccacca 51960 

tacccagcta atttttgtat ttttagtaga gacggggttt caccatgttg gtcaggctgg 52020 

tctcgaactc ctgacctcgt gatccgccca cctcggcctc ccaaagtgct gggattatag 52080 

gcgtgagcca ctgtgcctgg ccattaggtg tgttttatca cccagcatca tgcagtttat 52140 

cttggtgaat gttctgtgta ctcttgaaaa gaatgtggat tctgctgttg ttgggtggag 52200 

tgttccagaa acatcaatta gatccagttg gttaatagtg ctcatcaggt tgtctctatc 52260 

cttccttcct gactgcctgc ttgagctgtc agttattgac aggggtgtgg agtctccaac 52320 

tctaatggtg gatttgttta tttctcctag tagttctatc tttttctctc cttctaccct 52380 

tgatcctctt ctccccctag ggcttcctgg tgttggtggt gggagagtgg ggtagtgaag 524 40 
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aacctggact ttagggccaa agaggccagg gttcaaatcc tggctctgtc acttcccagt 52500 

tgagtgaccc tggctggtgc ctgaatctct gtgagcctcc acttcctcct ctgtgaaatt 52560 

gagagcactt acctggcagg ctgtcatggg catcaagtaa cagggcactc cacctggacc 52620 

ctgacacgtg atgcacagga atgccagctg ctatgccatg ggtgtggcag tagtaataaa 52680 

gtgaccatct gtatcctcac cacagtgaag cctgtccagg gctttctctc ctatgccccc 5274 0 

atgcctccag gtggccttgg atcctgttgg ttctgtgctc tgctcagcga cctttctccc 52800 

gtgggagttc ctgggggttc agcttcatcc tacagacagc agcacacact ggctgtgcac 528 60 

cctttttttt tttttttttt ttttttttga gatggagtct cgcttttttc gcgcaggctg 52920 

aagtgcagtg gtgtgatctt ggctcactgc aacctctacc tcctgggttc aagtgatttt 52980 

cctgcctcac cctcccaagt agctgggatt acaggctccc accaccacgc ccggctaatt 5304 0 

tttgtatttt cagtagagat ggtgtttcac catgttggcc aggatggtct tgaactcctg 53100 

acctcaggtg atccgcccac ctcagcctcc caaagtgcag ggattacagg cgtgagccac 53160 

cacacccgga gtgccggttg tttttagcag tttgtcttgt tcctggagag actggctcct 53220 

gcccaggagc tcggggagta gggccgcggg gtgctgcctc acacctcgag tttggccgta 53280 

agcagagggg acattttgtg actgtccccc tcctgagctt cccagcagct tttctccaag 53340 

ttacagccca aaagctcagg tggatttgca acccaacggt gtctgtgcac ctcccactga 53400 

tgcccgaact gccctggcca agaaacgggg ccgtcagaac gctgcactaa ctgcagcctt 534 60 

gggcctccat gccagaggcc atgcccttcc atccaccacc ccctggcctg ggccctggcc 53520 

ctcctggctc gggaactcca ggccccttcc tcacggatcg agagacgtgt atttaccgca 53580 

caggtgcttg tcattctctt gtggcctctt ctccagggag atcacagaag gacagggcct 53640 

cactgaggtc tcggacatgg accctttgat agtggcagga gccaggctgg gcaagaggcg 53700 

gccacagtca cctcagcagt gccatcacca ccgccattca gcccttccct gagccgggcg 53760 

cgcccctggc tctggcccca gtgtcccagt tacagctcac aggagcttgt ggtgcccagc 53820 

ggctgcttct gattgagagt cgaggtcgga ggctttggga ggctgagagg ctgctcggtt 53880 

tcacaactgc tgagggagac ttgggctcca tctcaggtct gccccatgtc gccctcaacc 53940 

tccagccacc ggtcctccgt gtcccccatg gccaggcacg gcttgcagac atctgtcgtt 54000 

ggctcctctc agccgtcgtg ggctgaccct ggcacgtcct cctgtggctg agcccagtgg 54060 

ggacagctgc ttccttttat taccctagaa ctctcgtctt tgatcaggcc ccctccccta 54120 

tgccacacag tccctgtcac tcgggtgagc ccagtagtca tggggaaggc ctgcgggttc 54180 

caaacatcca aaggcttgcg tgcagcatga cagcttgaaa ccgatgtttt ttaccttgat 54240 

cagatttcag cttggcgggg gctttgctca gctttcagtg aggcctgggc cgatttccca 54300 

gcatcccctc ctgaggccag cctctgtttc ctgtgatttt ctgcacaaag tgggagggag 54360 

gagtcttagg aaatgggggg ccacctcgaa acctaggcct cctctggctt ctctgtgcca 54420 

gtgcccccac gctttgtgtc tgtgtcccca gcccatggga ctgtgttatt ccctgagtgc 54 480 

tgccgcatgc ccagcccgca ctgaggacgt ggagccccga ggggcaggat ggcctccatg 5454 0 

gtcacacgta ggaagtggcc tccaccctcc gatgatcctc tccccccctc cctttcagcg 54 600 

ccttccccgg gggtgtcatc agccctcctg cctgtgcttt gtcccgtctt ctgcaggcgc 54 660 

atgggacgtg ctgacaggtc ctctgccggg ttcctgcctt gctatgcgca cgctggtcac 54720 

cacagaggcc tggcccttct tctgtagcag tcccacaccc gcaacaggtg tggctgctga 54780 

ccacctgctt tctgcccctc tggtcctgag gagggcgcag tgggcactca ggcgtggctg 54840 

agcagatgtg tgttgccggg aggaggaagg actgctccag tcagggctga atttcccacc 54900 

cggagcattt ctgctgtatt tggtgtagcg cctgctgctt aaagctctga ttcccagttg 54 960 

gcaccctttc ccttctgcat tgaaaaacat acggatgcat gtcttcttgc agtgaatgtg 55020 

tattctccca gcctctcttc tgggttgggg ctggaggtgg agcggcacac aggagccgca 55080 

gcgatggagg atgtgcgggt gcagcacccc gtacagcagg gatgccaaac ccgcgctgag 55140 

tccctctcaa cttctgcttt gaagcccagt cacgccattg cctgggtttt gctgggcggg 55200 

gctgcatgtg atgttctcct ctgtccctcc cccagagccg cccacctgct ccccggacca 55260* 

gtttgcatgt gccacagggg agatcgactg tatccccggg gcctggcgct gtgacggctt 55320 

tcccgagtgc gatgaccaga gcgacgagga gggctgcccc gtgtgctccg ccgcccagtt 55380 

cccctgcgcg cggggtcagt gtgtggacct gcgcctgcgc tgcgacggcg aggcagactg 55440 

tcaggaccgc tcagacgagg tggactgtga cggtgaggcc ctccccgtca aggctctgcc 55500 

aagaccctgg ccctgccctc cgggatacga gcttggggct gcctccggcc tcacaggagt 55560 

aggggctctg aaaacctttg cttgcaggga gattgccaag tctgtctttt aggcccaaca 55620 

aggaaaactc tgcagttcca cccatcctgt cccaccaggt agtgtggctt gaaggcagac 55680 

tgtgagggtc tatctcacct tcctgcatta ggtcaggagt ttcacagaaa cctgaggcac 55740 

attcaggggt gggctgcaga ggtccatggc tcacaccctg gaaaatccgc ccccaaaaga 55800 

cagtgctgtc tccactgacc agtctgtggg atagtgctta agcctgagtg gtttctatca 55860 
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acatgtagaa tcaggaggta taaagagatt tgctcaggca tcctgggccc tctctgacca 55920 

gcaggatctt cctttagatc ttgacagtga aacacatctc ttctgtgccc cctgtgagtt 55980 

ttctttcatt cattcattca ttcattcatt cattcattca ttcgagacag agtcttgctc 56040 

tgtcacccag gctggagtgc cctggtgtaa tctcggctca ctgcaacctc tgcctccagg 56100 

gttcaatcga ttctcctgcc tcagcctccc gagtagctgg gatgacaggt gcgcaccacc 56160 

atgcctggct aatttttgta tttttagtag agacagggtt tcaccatgtt ggccaggctg 56220 

gtctcgaact cctgacctca ggtgatccgc ccgcctcagc ctcccaaagt gctgggatta 56280 

caggcatgag ccaccgcgcc cggcctgagt tttcctttta tgaaggacct gcttggttgg 56340 

ttgcctgcca catgttgtca gcaccatggg cccaggactg ctgaggagct gttgatgccc 56400 

tcgctctccc agagccaccg gctctgttag ataattcaca tgcagtctgg ccactgtcct 564 60 

acgtcctcat tcacaaagag cagacatttc gtagaagatg agggcctggg agtaacctcc 56520 

ctgcatgttt ttctataaag gcatagtggt taagtccttc cagctcattg accattggag 56580 

aattttatgg aggctgtaga ctaggggctg gtaaactaag ggcccagggg ccaaatccag 56640 

cctgccacct acttttgtaa ataaagtttt cttggtgcac agccatgccc attcattcat 56700 

ttgcacaatg tctgtggctg ctttcatgcc aaaagcagga gaactgagtg gttatgctgg 56760 

agacctacgg ccttcaaagc cccagacctc acgtctggcc cttgacagac agagcttccc 56820 

cagccctgct gcgcatcctg gcccagcatg tgctgtgtgt gtgatttcag cttgcaggag 56880 

ccgtggttag gaattgtccc tgtgttggtc cattttgcat tgctatgaag gagcacctga 56940 

ggccgggtag attatgaagg aaagaggtct gtctggctca tggttctgta ggcagcacca 57000 

gtatggcacc cgcatctgct cagcttctag tgaggtctca ggaagctttg actcatggtg 57060 

gaagtcgaag cgggagcagg tgcatcacat ggtgagagag ggagcaacgg agagagagag 57120 

agagagagag agagcgcctc tccctcttgc cctcaccttg agaggagatg ccaggctcct 57180 

ttaagtaacc agctcccatg tgaactcaca gtgagagccc atttgctact gcggagaggg 57240 

caccaggcat ctgctcccat gacccaaaca ctgcccacca ggccctacct ccaaccttgg 57300 

ggtcatattt tattctgttc tatgctatgc tatgctatgc catgccatgc catgccatgc 57360 

tattcctatt ctattatttg agacagaatc tcgctctgtt gcccaggctg gagtgcagtg 57420 

gcatgatctt ggctcactgc aacctccacc tcccaggttc aagcgattct cctgcctcag 57480 

cctcccgagt agctgggatt acaggcacac accaccacac ccgggtaatt tttgtatttt 57540 

caatagagat ggggtttcac catgttggcc aggctggtct caaactcctg gcctcaagtg 57600 

atccacttac ctcggcctcc caaagtgcca tgattacaga tgtgagtcac tgcgcccagt 57 660 

gagggtcaca tttccgttga gatttggagg ggcagacgtt ggagccatct gagccccctc 57720 

gtcccgctct agcttctcct cccgtgtgcc ccgcggtgct ggtggcaggc ccttacgccg 57780 

gttctggctg cacgctctgt tccagaagct ttcttccctg cttggttacc agaaaatcat 57840 

cccatccatt acaaggacag ggtcccctta tctcccattc ccagggcagg acaccggggg 57900 

cagggcaggt ggggaactga gcaagttctc tgggggcagg cgtggctatg gctccctctg 57 960 

ggtgggcgtc tggggagggg tggaggcagc cgtcagcgcc ctggcttgct cttcctccct 58020 

ggccagagac tgtggccttg tgctgctccc gtgtgggctg cctgcacctc cagtgggttg 58080 

tgctccctcc cctcccctcc cctcaagctc tgctgagcac cactgccttc cacagccccc 58140 

actctcggga ggcgaggctc ctcgtggcca ttcctgtcct tggcacccac ccccccacca 58200 

acctggtaga gccttgggcg gggtctgtta ctccttgcat ggcgtagacc tccccacagt 58260 

aggcacctga cacatacctc ctggggggca ggcaggaggt gcgttgaggt ctcagccctg 58320 

gcagtccctc ccctgcgtgg cataggcctc gccacagggt catcgagggt gggtggagac 58380 

tgtactagac cactccccgc tggtcctaga aagggtccca tctgtctgct ctctgtttgg 58440 

agtccagacc ttggttgctg tgccctgcat ggtgggctgg ggggcaccct ccagcctctc 58500 

tgagtgcatg gcctctcctt gcagccatct gcctgcccaa ccagttccgg tgtgcgagcg 58560 

gccagtgtgt cctcatcaaa cagcagtgcg actccttccc cgactgtatc gacggctccg 58 620 

acgagctcat gtgtggtgag ccagcttctg gcacggggaa ggggcgtccg ggctgggttc 58680. 

ccccaggaac gtggagttta ggggaggaga cgtgcctttc cagcggggct gggggctgtg 58740 

tgggagactc aggcggctgg gaggctcctt gcgggaggca gggaagcctt tcccagggca 58800 

gcggccagga ggacagactg tgagctgtgg gctcggcggc tacagagtct gcctcagtgg 58860 

gcggggctga tggtgtccag gtgcctgcag cacgcaccca cccacgggac cttgctgagc 58920 

agcgtctgtc aggcagcaag attacccgag ggctgcagtg gtcctgttcc ctggcagctt 58980 

actgtctggc tgaggaggag tgatgttcac atatgcacac atgtcatgtg cacacacatg 59040 

tacatgacaa catcccacat gctcctcaaa tagcatgacc tgtacagtca cggatatagg 59100 

gcctagggga taggaggcca agacagtcag ggaagacttt ccagaggcag tggctcctga 59160 

aaggctgtct gattcaggca ggaagggagc tgagttcaga taggaagtag caatgagtca 59220 

ttgtgtctgg ggacatggcc actccttcgc tgcagaggga cctgggctga gagctcctct 59280 
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cttatggctg cagtcgggag agaagtctgt tggggggaga agggggcttc ctcaagggac 59340 

tccctgtgcc ctttggcacc ttcgtgccag gtcaggcttg aggcctgaag gcagtggtgg 59400 

gggccaccaa gggtcgcctc ctctgctggg caagttccca gtctgacggg cctgtgccgt 594 60 

gggccccagc tgtgggggcg ctgttgatgc gcagccaggc ctcgccgcca gagcccgcac 59520 

gcttccattc cgctgacttc atcgacgccc tcaggatcgc tgggccggcc ctgtgggaga 59580 

gtgaatgtgg cttttgccaa agttgagtct ggagcctgga aacttcccta tgggcagcct 59640 

tgatagtgga gtggcccaag gagcccaccc agccgaccct gcccctcccg tggctggtgg 59700 

gcggcaccag gggctgcctg gctttgctcg ttcaccaaca tcacccgggc tggccagggc 59760 

gcgctcactt ctgccaccac cgagggccct gggcgaagga gtgaatacca ggctgccttg 59820 

gcagggatgt gttgagggct gtggggagtc ggacagcggc gggggtcaga ggaggaggag 59880 

ggtgcaccgt gcaggctgaa gggccacgtt accctgaggt tggccaggct ccccaggcct 5994 0 

agcctcccag ctcccccact ttctccccac cctccaccag tggcaaagcc agccccttca 60000 

gggcgcacgg tgtctgcccc caaggagggc ccattccgtt ggggttaatg ttggccacct 60060 

ctttctgttt gtctctggca gaaatcacca agccgccctc agacgacagc ccggcccaca 60120 

gcagtgccat cgggcccgtc attggcatca tcctctctct cttcgtcatg ggtggtgtct 60180 

attttgtgtg ccagcgcgtg gtgtgccagc gctatgcggg ggccaacggg cccttcccgc 60240 

acgagtatgt cagcgggacc ccgcacgtgc ccctcaattt catagccccg ggcggttccc 60300 

agcatggccc cttcacaggt aaggagcctg agatatggaa tgatctggag gaggcaggag 60360 

agtagtctgg gcagctttgg ggagtggagc agggatgtgc taccccaggc cctcttgcac 604 20 

atgtggcaga cattgctaat cgatcacagc attcagcctt tcccactgag cctgtgcttg 60480 

gcatcagaat ccttcaacac agaggcctgc atggctgtag caacccaccc tttggcactg 6054 0 

taggtgtgga gaaagctcct tggacttgac cttcatattc tagtaggaca tgtgctgtgt 60600 

tgtccacaaa tcctcatgta ccctagaaat gaatgtgggg gcggctgggc tctctccaga 60660 

gctgaaggaa tcactctgta ccatacagca gctttgtctt gagtgcagct gggatttgtg 60720 

gctgagcagt tacaattcct acgtggccca ggcaccagga acgcaggctg tgtttgtaga 60780 

tggctgggca gccgcaccgc agagctgcac catgctggtt tgtatcacat gggtgaccat 60840 

ggtatgtcta agaaggtgga gtccctgtga ggtctgcagg tgcccccaca gctccaggcc 60900 

accttgagga ttgcctctgc ctgcccagcc ctgagttccc tctcccctgt cctgtcccac 60960 

tgtcacccca agccggcctc attgggagcc tgttggatgg cagggtatag atgtaacctg 61020 

attctctctg gggagcgggg ttatctggct tctcaagagc tcctaggagc ccacagtggt 61080 

ggcaccatca cagtcgcagc agcccccaga gaacgcggcc ctgtctgttc ctggcgtgct 61140 

ctgtgctgcc ccgcctgggt tccctgcccc agtcgcaggc cccttggagg aggtaccatg 61200 

tgtctcccgt ttcacagatg agccccgggg agctcactct agtagtggcc agagaggcct 61260 

gcggctcagg gagcggggca catttccaac aggacacacc gccctggtct gagtctcgtg 61320 

ggtagtggga gcagaggaga gcgccctatg tctgtggggc ggcttggctg agcctggaag 61380 

ccacctgacc tcccccgtcc cttccctgcc aggcatcgca tgcggaaagt ccatgatgag 61440 

ctccgtgagc ctgatggggg gccggggcgg ggtgcccctc tacgaccgga accacgtcac 61500 

aggggcctcg tccagcagct cgtccagcac gaaggccacg ctgtacccgc cggtgagggg 61560 

cggggccggg gaggggcggg gcgggatggg gctgtgggcc cctcccaccg tcagtgctgg 61620 

ccaccggagg cttcccgggt tcctgggggc tgtgccaccg cctctgaggc atgcttgctt 61680 

tcttcccttt tcaaaccctt ctgcttcctt ctttaatgac attgttgatt gtggataatc 61740 

tgaaaactac acaaaaatat aaagagccaa aatctcaccc aaatccacct cctagagtgg 61800 

ctgttgggct ccgtcagcat ccaggcggcc gtctgtgttc cgcacggccc agcccatcga 61860 

tagccgcctg caccaggcct gtctgccctc tgtgagcctc cccacagggt tccctccaca 61920 

aacaccctgt tctcccaccc agggctggct gcttcctgga aaacagctgg atggttttgt 61980 

gcatgacaga caaacacagg gtgattttcg tggctaaaat actccctgga gcttttggca 6204 0 

gggtgagggg ctggctccag ctgagccacg cct'tgagtga aatgactgtg aggagaataa 62100 . 

actgccgctg ccctccagga tcactggggc tggctgggga gaacccccgt ttctgggagc 62160 

acagtcccag gatgccaagg cgagcttggt gccgagatgt gaactcctga gtgtaaacag 62220 

cgggggctga cttgacatgc tttgtatgct tttcatttgt tcctgcagct gtatgcccct 62280 

aaggtgagtc cagccccctt ctgcttcctc tggggcctcg ccagtgagcc ccaccttgct 62340 

ggggctggtt cctcctgccc ttctgggtat ccctcacatc tggggtcttg tcttcttgtt 62400 

ttatttttct tttttttttg agacggagtt tcacttttgt tgcccaggct tcagtgcaat 62460 

ggtgtgatct ctaggctcac cgcaacctct gcctcccagg ttcaagcagt tctcctgcct 62520 

cagcctccct agtagctggg attacaggca tgtgccacca cgcccagcta attttgtatt 62580 

tttagtagag atggggtttc tccatgttgg tcaggctgat cttgaactcc ctacctcagg 62640 

tgatccgccc accttggcct cccaaagtgc tgggattaca ggcgtgagcc accgcacctg 62700 
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gcctttttct tttcttttct tttctttttt ctgagacagg gtctcgctct gtcacccagg 627 60 

ctggagtgca atggtgtcat catggctaac tgcagcctct accttctagg ctcaagcaat 62820 

cctcccatct cagcccctaa gtagctagga ctgcacgcat gcatccccat gcccagctaa 62880 

tatttacatt ttttgtagag atgaagtttc actatattgc ccaggctggt ctccaactcc 62940 

tggactcgag cgatcctcct gcctcggcct ccccaggtgc tgggattaca ggcgtgagcc 63000 

accgtgcctg gcctggggta ttgtcttctt atggcacctg actgtggtgg gccctgggaa 63060 

ggaagtagca gaagagggtt cttcttggtt tcctggacag taactgagtg ttctggaggc 63120 

cccagggcct ggctttgttt agggacaaag ggaactggta accagaagcc gagagtttaa 63180 

acacccactg cccttcttcc ctgctcctgc tgctgcaacc cagcttaacc agccaggagt 63240 

gctaggaacc caagcagggc ccccgagcac acagcaggca gctcacgaat tctcttttcc 63300 

tgttctccct tgggagctgg gaggatctta atcaggcaat aagagatggc actgagcagc 63360 

cagctaattt tttaaatcac tttattgttt aaccatatga ctcacccact taaaaaaggg 634 20 

tacagttcag tgggttttag tgtattcaca gatgtgtgca accctcacca cagttaattt 634 80 

tagaacattt tcctgcccct aaaagaaact ctgcatgaag ccagctg.ttt ttaaattagc 6354 0 

aaagttattt tgcatccttt aaatatatgt tcatggtaca aaattcaaaa gatacagaag 63600 

agtctgcagt ccaaagagac tccgccccca tgacgccaag caggactccc tgggaggcat 63660 

ggcctcctgc agtgtgtttc ttctatgtcc ccccaggggt catctgtaca tatgcaagca 63720 

tacaagagcg tggactttgt tttccaagcc agaagataat tgtagattta tgtgcagttg 63780 

tgagaaagag cacagaccca tttatcctct gcctggtttc ccccagtgct gcctgccatc 63840 

ttgcatgact tccattccta tcataagcaa gacactgata acgattcttt caccttattc 63900 

agattgacat aagtgttttt tgtttgttct tgagacaaac ttcctctgtc acccagtggg 63960 

agtgcagtgg cacaatcaca gctcactgca gcctcaaact cctgggctca agcgattctc 64 020 

ctgcctcagt cccctcaagt agctcagatg gcaggtgtgc accatcatgc caggctaatt 64080 

tttaaatttt ttgtggaggt gaggcctcac taaatttcct gggctagtct tgaactcctg 64140 

agctaaagtg atcctcctgc ctcagcctcc caaagtggta ggattacagg catgagccac 64200 

tgcgcctggg ctgacatatg tgttttcgta agcccgaaag atagcatctg aagagtcaac 64260 

attgagcctt gccttttgct gctaatgatg tataaaagct gctgttctga gcatttcgga 64320 

ggctcccagc tgccgtgtgc accctgccta gagctctacc gtaacccatc tccgggagga 64380 

ggtgctattg ttttcctcat tttgcaacaa ggaggctgaa. gaactgagca tgaaccactg 64440 

gcctgggtcg ttcggttggt aggcagtggg gccaggccat ccaactcaca accaccttct 64500 

actctgcttc ccccgcaccc tgaagtttgt tctgttttga ggacacagcc gtcacattct 64560 

tggtggctga acagcactcc ttgtcaggtg tggctgggcc cccactggag ggcatcatgg 64 620 

tcctctctcc tgctgcggtt gaaccttggc tgtttcaacc actcctgcca agtggccctc 64 680 

tgaaagggac agtccatctt ttctcagcag agggccacac tggcaaaacg gtccctggca 64740 

ccctttctct ccacctgtct aatatagagt aaaaatggta tcatgttaag atcttcattt 64800 

atatttattt tatcatgaat gatgtaagca tcattttgtg tgtttaagaa cctttgggcc 648 60 

cagcgtgatg gcttgcagct gtaatctcag cactttagga ggctgagatg agcggatcac 64 920 

ttgaggccgg gagtttgaga ccagcctggc caacatggag aaaccccgtc tctagtaaaa 64980 

atttaaaaat tagccgggta tggtgatccc agctacttgg gagtctgaag catgagaatt 6504 0 

gcttgaacat gggaggcgga ggttgcagtg agccgagatc gcgccattgc actccagcct 65100 

gggcgacaga gcgagactct gtctcacaaa aaaaaaaaaa aaagaaaaga aaagaaatta 65160 

tcaatctcct cttttatggc atatatatat atatatatat atatatatat ttatttccct 65220 

ttcttggtta tgttcataaa ggcctcccct gctctgatca taaaaaacaa cttattttca 65280 

cactctctct cttttttttt tgagacagag ttttgctcct gttgcccagg ctggagtgca 65340 

gtggcgcaat ctcagctcac tgtaacctcc gcctcccggg ttggagtgat tctcctgcct 65400 

taccttcccg agtagctggg attataggca tgcaccacca tgcctggcta attttgtact 654 60 

tttagtagag acgggggttt ctccatgttg gtcaggctgg tctcgaactc gcgacctcag 65520. 

gtgatccacc cacctcggcc tcccaaagtg ctgggattac agacgtgagc caccatgccc 65580 

agcccacact ctctttctta acgtcctcct cctttcgttt tacgttcaca tctttaattc 65640 

ttctgggatg taattagatt tgatgagcaa ggtgggcatc cagcttgttt cttggctgat 65700 

ggcttatggg tggcgtgaat tagtcggggt ctatcaggag gcagaaactc tatgagaatt 65760 

tgaacagaga aagttccgtc tacaggctta ttaccaggga ctggaatagc agaaattgaa 65820 

cagtgagatg tacagagaac tctaagaatg caggaatagg ccaggcatgg tggctcacac 65880 

ctgtcatccc agcactttgg gagaccaagg cgggtggatc acctgaggtc aggagttcga 65940 

gaccagcctg gccaacatag tgaaacccca tctctactaa aaatacaaaa aaattagctg 66000 

ggtgtggtgg cgcatgcctg taatcccagc ttctcgggag tctgaggctg gagaatcact 66060 

tgaacctggg aggcagaggt tgtagtgagc cgagatcatg ccattgtact ccagcctggg 66120 
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caacaagagc gagactcagt caaaacaaca acaacgcagg aatagcagat gagccgaggt 66180 

ggggcctccc cagcccccac cccccacccc gcaccctggg ccgagatcca gtcctctttg 66240 

aatagggcct gggcgtggtt cacgggacat ctgagacatt gccgaggcgc tgcactggtg 66300 

gatcttgcca gaagtctgcc cagtgcagat ttgggcagaa tctcaaactg ccttgggatg 66360 

taggagagaa accaggcctg gtcaagttca tgggaagagg tggaaacaga ccccataggc 66420 

tggggcttgg gcagctgtag gaagccctct ctgctgcctc cctgcctgct ctctgctttg 66480 

aagcatcttc cccagtgccc ccagtctcat gccctctcaa cgttggggtc aaatcctgag 66540 

gaatacccag actggctctc tgggccaaag aggaccctct ccagaaagag cagggcccag 66600 

tgcggcttcc taaagggcag gggaagggcc tggccactcc ccagaggcta ctcaccagcc 66660 

atcaggatag ccccaggaag caggccttct cgagcccatt ttattacttt attttattat 66720 

tttatttaat tttaaattta ttttttgaga cagagtctca ctctgttgcc caggctggag 66780 

tgcagtggtg cgatctcaac ccactgcagc ctctgcctcc agggttcaag ggattctccc 668 40 

acctcagcct cccaagtagc tgggattaca ggtgcccgcc accacacccg gctaattttc 66900 

atatttttag tagagacgag gtttcaccat gttggccagg ctggtctcga actcctgacc 66960 

tcaagtgatc cgcccgcctc ggcctcccaa agtgctaggt caagcccatt ttaaagttga 67020 

agaaactgag gctgaggtaa attccctccc cagggatcct gctgcagcca gaaggtggta 67080 

aaacaggact tcacccgggt ctgtctggcg tgaaaggcag tgttcttgta ccaccctagg 67140 

gggcctgaga gaactgagtc cctcgggcat aactgacagt tctgttccca ttattccgca 67200 

ggggctcgga tctggctgta tgctttccag gatggccttg gagacccaca taagccctac 67260 

accctttggg aagctgcatg ttgggttggg gtgccgtcag tggcacttgt ggaaggtgca 67320 

gacctgtgtg ggtgtgtggg cccagggccc ctggtccctt cctccctttg tagggctggt 67380 

tgtgtgctgc ctggacctgg ggggcacgtt cacgtggtga atttgtctat ttactatccc 674 4 0 

cgctttgggg ctggtgccag cacaggccct tgtgaagggg gtgcctttgt ctggagtggg 67500 

actgtggccc ctccctcagc gtggtgactt ctgtgtcagg gcttcagcag ggacgcagag 67560 

cccctgagtg ttcggaacaa gggcgtcatt gcaggagtta gactgtgtgt gatggaggga 67620 

ggaggggcag gaggaaaggt cagaaggaga gttcctggga aggtccctga ggagcctggt 67680 

gaggtgctaa ctggtgtgga ggacactcag ggcctgtggg gacatctcct actgctgggg 67740 

gccagccaca aagggaactg gccgaagtcc tgtccccgcc ttcacagccc agcatctggt 67800 

cacaaggcag gtacttggaa gggcgcgggc acctgggcca aaagtgcctg ggttcccttt 67860 

gcctttcact gagatgacct tcggggcagg tggctgctgc ctcccctcct gtccccaggt 67 920 

tttgccaact ggccagagga aggggtcctg ggaagcaggg gggccagaag ccctctctgc 67980 

aaggaaagcc cgaggggtgt gggaggaagg aaggaatgcc caggctggcg aggctctaag 68040 

tcaccctggc ttggctctcc tcagatcctg aacccgccgc cctccccggc cacggacccc 68100 

tccctgtaca acatggacat gttctactct tcaaacattc cggccactgc gagaccgtac 68160 

aggtaggaca tcccctgcag ccctccatgg ccattgggtt cccgccagcc cgtggtggag 68220 

gggcctaatc cccatgccac tgatgagggg aggtattctg ggtgctagtg ggcaggtgcc 68280 

gggcccagcc ctgcctccct ctgctctgcc aaccacacta ggctgcctcc ccagacaagc 68340 

tcagcgggca ctgcatgttg ggttcagaaa tcagcagaac tccacgttct gagctgctct 68400 

tcaagttgct cctatggggg ttacttttaa gctgggaaat ggctgtggcg tcgaggggcc 684 60 

gggggcttgg gctccaaact ctgactgtgt gtttgagtcc ggctgtggaa acctagccat 68520 

tgagatgccc cctcttggtg gctctgtcct cttaggatgg gacaagtctg tgaaggctgc 68580 

tgcagcaccc accgtagacc cctaatcgtg tgacgtcacc aggatggtcc gggctgctca 68 640 

cttgccacag tggcctgttt gagcccggga agccaacggg gctgctcagc tggacaccag 68700 

ccccccgagc tgcccatgtt ggggtcacag gccccacctc cctggttggg gaggggcaac 687 60 

tgagagtgtg gagaggtggg acccaggtgt gctggtctcc gcaggggctg gatcagagcc 68820 

tgggatgggc agggtgagcc tcctgacctt taacccagtg gtgtcaggca acgtggccca 68880 

cccgccagcc gcaccaggcc ccacccccgc aggtgaaggg gtgggatagg ctgggcctgg 68940. 

gccaggacac ctctggacca cgcattcctc attgcttggg tccctggagc agcagggcct 69000 

cccgagtgtg gtgccgcctg ccacctagtg gccatttcca cgaactccca ggcctggctg 69060 

gggagccgga actgcagcct ccatttccac cccactccgg gtcgggccac ctccctgatg 69120 

cctcagtatt atatcaaact gtcacagtct gtcccacagc cttacagacc actgtctcca 69180 

g r aatggtcac atccacactg ggcagcccag tctcgctagt tcctcgtccc acctcctgcc 69240 

tttgctcatg cccgtcctgc tctgggccca ccgcggacac atcttccccc cgcccgccgt 69300 

ctgacctcac agcagctggg ccccaagagg agtatcctgt cctgctgcac ttttctcaac 69360 

acccggtgtt ggctgcacct tcccacccat tgcaggcccc tctgtgacag gacgggggct 69420 

cctaaacaca ccacagttcc gagtctgaac tcacacagtg ggatgcggcg tttctgggcc 69480 

acagttgggt gcaggtagcc tctgggagga tgggaggtca ggagccatct tgcgagtcag 69540 
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gttgcttgaa ctcaggatgg aagtgttccg ggcccattgg ttgctgtatt agcctgttct 69600 

cacgctgcta ataaagacat acccaagact gggtaattgt aaaggaaaga ggtttaacgg 69660 

actcacagtt ccacctgcct ggggtggcct cacaatcatg gtagaagaca aggaggagca 69720 

agtcacatct tacatggctt cagggaacag acagcatgag aaccaagcga aaggggtttc 69780 

cccttgtaaa accatcaagt ctagtgagat ttattcacta ccacgagaac agtatggggg 69840 

gaaccacccc catgattcaa tcatctccca ctgggtccct cccacagcac gtgggaatta 69900 

tgggagtaca attcaagatg agatttgggt ggggacacag ccaaacccta tcggttgcca 69960 

acatttacag taacagtgtt aggtgaacag ■ ttgtccagtc tcctgttttg tcggacactg 70020 

tttctagcac cttccaggca gaatctcatg tatccttcac tttcgaaatg ggtactattt 70080 

catccccact tttatcaatg agaaactaaa gctcgaagag gtcaagtaag ttcctggcca 70140 

aggtcagcta gcaggctcta gaggcctcgt tctccttaga ggcagccttg ccagggccca 70200 

ggcttggcag gctgcagggc aggtgcgggc atgcccatgg tagaggtggg accattgagg 70260 

ctcagagagg gtaagtgatg agccctggcg acacagcggg gtgggtccag agtccggcct 70320 

gcatcttctg gagctggcca gtggacaggc ctttcccgtt cacagccccg gggctgctgt 70380 

gcccaccagg gcggatgtgc ctaccgaatc ccactcctct gtgtgtgtcc ctttcaggcc 70440 

ctacatcatt cgaggaatgg cgcccccgac gacgccctgc agcaccgacg tgtgtgacag 70500 

cgactacagc gccagccgct ggaaggccag caagtactac ctggatttga actcggactc 70560 

agacccctat ccacccccac ccacgcccca cagccagtac ctgtcggcgg aggacagctg 70620 

cccgccctcg cccgccaccg agaggagcta cttccatctc ttcccgcccc ctccgtcccc 70680 

ctgcacggac tcatcctgac ctcggccggg ccactctggc ttctctgtgc ccctgtaaat 70740 

agttttaaat atgaacaaag aaaaaaatat attttatgat ttaaaaaata aatataattg 70800 

ggattttaaa aacatgagaa atgtgaactg tgatggggtg ggcagggctg ggagaacttt 70860 

gtacagtgga gaaatattta taaacttaat tttgtaaaac agaactgcca ttcttttgtg 70920 

ccctgtgtgc atttgagttg tgtgtccccg tggagggaat gccgaccccc ggaccaccat 70980 

gagagtcctc ctgcacccgg gcgtccctct gtccggctcc tgcagggaag ggctggggcc 71040 

ttgggcagag gtggatatct cccctgggat gcatccctga gctgcaggcc gggccggctt 71100 

tatgtgcgtg tggcctgtgc cgtcagaaag ggccctgggc ttcatcacgc tgttgctgtt 71160 

cgtcttcctc agattcttag tctttttttt tttttttttt ttttgagacg gagtctttct 71220 

ctgtcatcca ggctggagtg cagtggtaca atctcagctc actgcaagct ccgactccca 71280 

ggttcaagtg agtctcctgc ctcagcctcc cgagtagctg ggactacagg tgcgcgccac 71340 

cacacccgcc cagctaattt ttgtattttt agtagagatg gggtttcacc atgttggcca 71400 

ggatgatctc gatctcttga cctcgtgatc cgcccacctc ggcctcccaa agtgctggga 71460 

ttataggcat gagccactgt acccagctga ctcttagtca cttttaagaa ggggactgtg 71520 

ccttcatttt tcactgggcc ctgcagaata tatgcctggg ctctgggctc ttctgaacct 71580 

gtgttggctt ccatctgacc tctctgtgcc agcccaaggc tgctgctctt cctgagggca 71640 

aggagcccca tgactgcgtg ttgactcgct ggatggggct gctgagccca ctctgccaca 71700 

ccacgtgccc ctggcaggga gggaatccct gggtcctcac aggaacagtc agcaagccac 717 60 

acctgacgcc tgctgtgggc ccatccctgc ggtgctggag aagacagaca aggcctggtc 71820 

actgcctctg cagggtcccc agtccgtgga aggagacagt aatctaggca ttttcggtgg 71880 

ggaagctgag ctgttctcgt gtcctgaagg ccaggcggga acagccgtct tcagagggaa 71940 

gggagaaaat gcacatcgca tcagtggaga agggcctgac ttccctcagc atggtggagg 72000 

gaggtcagaa aacagtcaag cttgagtatt ctatagtgtc acctaaata 72049 



<210> 10 

<211> 8705 

<212> DNA 

<213> Homo sapiens 



<400> 10 

ggactcaggg gcagcaggga ggtacaccca tggttagtgg gcggaccata gggggtaatg 60 

agagggtgaa tcgatggaac ctgggggaca caatcgaagt ggttccagag tcgggctgta 120 

ctaattaaag agacggggca gtggacaggc attttcagtt gactgcccag ggagtgttct 180 

gcccaacagg gaggatatgc gtacagaatc atactcgatc agcatgagtc caattcagac 240 

cgtacatcag tggagatatg ggtcccccga tgactccgtg gaacactgat gtttgtgaca 300 

ggggagtaca gcaccagcca tcagcaggcc agtaaatcat accggcctgc gaaattggac 360 
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tcagacccgg atccaccctg accgacgtcc caagccccca ccccccaccc cccaccatgg 420 

gccgagatcc agtcctcttt gaatagggcc tggccgtggt tcacgggaca tctgagacat 480 

tgccgaggcg ctgcattggt ggatcttgcc agaagtttgc ccagtgcaga tttgggcaga 540 

atctcaaact gccttgggat gtaggagaga aaccaggcct ggtcaagttc atgggaagag 600 

gtggaaacag accccatagg ctggggcttg ggcagctgta ggaagccctc tctgctgcct 660 

ccctgcctgc tctctgcttt gaagcatctt ccccagtgcc cccagtctca tgccctctca 720 

acgttggggt caaatcctga ggaataccca gactggctct ctgggccaaa gaggaccctc 780 

tccagaaaga gcagggccca gtgcggcttc ctaaagggca ggggaagggc ctggccactc 840 

cccagaggct actcaccagc catcaggata gccccaggaa gcaggccttc tcgagcccat 900 

tttattactt tattttatta ttttatttaa ttttaaattt attttttgag acagagtctc 960 

actctgttgc ccaggctgga gtgcagtggt gcgatctcaa cccactgcag cctctgcctc 1020 

cagggttcaa gggattctcc cacctcagcc tcccaagtag ctgggattac aggtgcccgc 1080 

caccacaccc ggctaatttt catattttta gtagagatga ggtttcacca tgttggccag 114 0 

gctggtctcg aactcctgac ctcaagtgat ccgcccgcct cggcctccca aagtgctagg 1200 

tcaagcccat tttaaagttg aagaaactga ggctgaggta aattccctcc ccagggatcc 1260 

tgctgcagcc agaaggtggt aaaacaggac ttcacccggg tctgtctggc gtgaaaggca 1320 

gtgttcttgt accaccctag ggggcctgag agaactgagt ccctcgggca taactgacag 1380 

ttctgttccc attattccgc aggggctcgg atctggctgt atgctttcca ggatggcctt 14 40 

ggagacccac ataagcccta caccctttgg gaagctgcat gttgggttgg ggtgccgtca 1500 

gtggcacttg tggaaggtgc agacctgtgt gggtgtgtgg gcccagggcc cctggtccct 1560 

tcctcccttt gtagggctgg ttgtgtgctg cctggacctg gggggcacgt tcacgtggtg 1620 

aatttgtcta tttactatcc ccgctttggg gctggtgcca gcacaggccc ttgtgaaggg 1680 

ggtgcctttg tctggagtgg gactgtggcc cctccctcag cgtggtgact tctgtgtcag 1740 

ggcttcagca gggacgcaga gcccctgagt gttcggaaca agggcgtcat tgcaggagtt 1800 

agactgtgtg tgatggaggg aggaggggca ggaggaaagg tcagaaggag agttcctggg 1860 

aaggtccctg aggagcctgg tgaggtgcta actggtgtgg aggacactca gggcctgtgg 1920 

ggacatctcc tactgctggg ggccagccac aaagggaact ggccgaagtc ctgtccccgc 1980 

cttcacagcc cagcatctgg tcacaaggca ggtacttgga agggcgcggg cacctgggcc 204 0 

aaaagtgcct gggttccctt tgcctttcac tgagatgacc ttcggggcag gtggctgctg 2100 

cctcccctcc tgtccccagg ttttgccaac tggccagagg aaggggtcct gggaagcagg 2160 

ggggccagaa gccctctctg caaggaaagc ccgaggggtg tgggaggaag gaaggaatgc 2220 

ccaggctggc gaggctctaa gtcaccctgg cttggctctc ctcagatcct gaacccgccg 2280 

ccctccccgg ccacggaccc ctccctgtac aacatggaca tgttctactc ttcaaacatt 234 0 

ccggccactg cgagaccgta caggtaggac atcccctgca gccctccatg gccattgggt 2400 

tcccgccagc ccgtggtgga ggggcctaat ccccatgcca ctgatgaggg gaggtattct 24 60 

gggtgctaat gggcaggtgc cgggcccagc cctgcctccc tctgctctgc caaccacact 2520 

aggctgcctc cccagacaag ctcagcgggc actgcatgtt gggttcagaa atcagcagaa 2580 

ctccacgttc tgagctgctc ttcaagttgc tcctatgggg gttactttta agctgggaaa 2640 

tggctgtggc gtcgaggggc cgggggcttg ggctccagag tctgactgtg tgtttgagtc 2700 

cggctgtgga aacctagcca ttgagatgcc ccctcttggt ggctctgtcc tcttaggatg 27 60 

ggacaagtct gtgaaggctg ctgcagcacc caccgtagac ccctaatcgt gtgacgtcac 2820 

caggatggtc cgggctgctc acttgccaca gtggcctgtt tgagcccggg aagccaacgg 2880 

ggctgctcag ctggacacca gccccccgag ctgcccatgt tggggtcaca ggccccacct 294 0 

ccctggttgg ggaggggcaa ctgagagtgt ggagaggtgg gacccaggtg tgctggtctc 3000 

cgcaggggct ggatcagagc ctgggatggg cagggtgagc ctcctgacct ttaacccagt 3060 

ggtgtcaggc aacgtggccc acccgccagc cgcaccaggc cccacccccg caggtgaagg 3120 

ggtgggatag gctgggcctg ggccaggaca cctctggacc acgcattcct cattgcttgg 3180 . 

gtccctggag cagcagggcc tcccgagtgt ggtgccgcct gccacctagt ggccatttcc 3240 

acgaactccc aggcctggct ggggagccgg aactgcagcc tccatttcca ccccactccg 3300 

ggtcgggcca cctccctgat gcctcagtat tatatcaaac tgtcacagtc tgtcccacag 3360 

ccttacagac cactgtctcc agaatggtca catccacact gggcagccca gtctcgctag 3420 

ttcctcgtcc cacctcctgc ctttgctcat gcccgtcctg ctctgggccc accgcggaca 3480 

catcttcccc ccgcccgccg tctgacctca cagcagctgg gccccaagag gagtatcctg 3540 

tcctgctgca cttttctcaa cacccggtgt tggctgcacc ttcccaccca ttgcaggccc 3600 

ctctgtgaca ggacgggggc tcctaaacac accacagttc cgagtctgaa ctcacacagt 3660 

gggatgcggc gtttctgggc cacagttggg tgcaggtagc ctctgggagg atgggaggtc 3720 

aggagccatc ttgcgagtca ggttgcttga actcaggatg gaagtgttcc gggcccattg 3780 
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gttgctgtat tagcctgttc tcacgctgct aataaagaca tacccaagac tgggtaattg 384 0 

taaaggaaag aggtttaacg gactcacagt tccacctgcc tggggtggcc tcacaatcat 3900 

ggtagaagac aaggaggagc aagtcacatc ttacatggct tcagggaaca gacagcatga 3960 

gaaccaagcg aaaggggttt ccccttgtaa aaccatcaag tctagtgaga tttattcact 4020 

accacgagaa cagtatgggg ggaaccaccc ccatgattca atcatctccc actgggtccc 4080 

tcccacagca cgtgggaatt atgggagtac aattcaagat gagatttggg tggggacaca 4140 

gccaaaccct atcggttgcc aacatttaca gtaacagtgt taggtgaaca gttgtccagt 4200 

ctcctgtttt gtcggacact gtttctagca ccttccaggc agaatctcat gtatccttca 4260 

ctttcgaaat gggtactatt tcatccccac ttttatcaat gagaaactaa agctcgaaga 4320 

ggtcaagtaa gttcctggcc aaggtcagct agcaggctct agaggcctcg ttctccttag 4380 

aggcagcctt gccagggccc aggcttggca ggctgcaggg caggtgcggg catgcccatg 4440 

gtagaggtgg gaccattgag gctcagagag ggtaagtgat gagccctggc gacacagcgg 4500 

ggtgggtcca gagtccggcc tgcatcttct ggagctggcc agtggacagg cctttcccgt 4560 

tcacagcccc ggggctgctg tgcccaccag ggcggatgtg cctaccgaat cccactcctc 4 620 

tgtgtgtgtc cctttcaggc cctacatcat tcgaggaatg gcgcccccga cgacgccctg 4 680 

cagcaccgac gtgtgtgaca gcgactacag cgccagccgc tggaaggcca gcaagtacta 4740 

cctggatttg aactcggact cagaccccta tccaccccca cccacgcccc acagccagta 4800 

cctgtcggcg gaggacagct gcccgccctc gcccgccacc gagaggagct acttccatct 4860 

cttcccgccc cctccgtccc cctgcacgga ctcatcctga cctcggccgg gccactctgg 4 920 

cttctctgtg cccctgtaaa tagttttaaa tatgaacaaa gaaaaaaata tattttatga 4 980 

tttaaaaaat aaatataatt gggattttaa aaacatgaga aatgtgaact gtgatggggt 5040 

gggcagggct gggagaactt tgtacagtgg agaaatattt ataaacttaa ttttgtaaaa 5100 

cagaactgcc attctttcgt gccctgtgtg catttgagtt gtgtgtcccc gtggagggaa 5160 

tgccgacccc cggaccacca tgagagtcct cctgcacccg ggcgtccctc tgtccggctc 5220 

ctgcagggaa gggctggggc cttgggcaga ggtggatatc tcccctggga tgcatccctg 5280 

agctgcaggc cgggccggct ttatgtgcgt gtggcctgtg ccgtcagaaa gggccctggg 5340 

cttcatcacg ctgttgctgt tcgtcttcct cagattctta gtcttttttt tttttttttt 5400 

ttttttgaga cggagtcttt ctctgtcatc caggctggag tgcagtggta caatctcagc 54 60 

tcactgcaag ctccgactcc caggttcaag tgagtctcct gcctcagcct cccgagtagc 5520 

tgggactaca ggtgcgcgcc accacacccg cccagctaat ttttgtattt ttagtagaga 5580 

tggggtttca ccatgttggc caggatgatc tcgatctctt gacctcgtga tccgcccacc 5640 

tcggcctccc aaagtgctgg gattataggc atgagccact gtacccagct gactcttagt 5700 

cacttttaag aaggggactg tgccttcatt tttcactggg ccctgcagaa tatatgcctg 5760 

ggctctgggc tcttctgaac ctgtgttggc ttccatctga cctctctgtg ccagcccaag 5820 

gctgctgctc ttcctgaggg caaggagccc catgactgcg tgttgactcg ctggatgggg 5880 

ctgctgagcc cactctgcca caccacgtgc ccctggcagg gagggaatcc ctgggtcctc 5940 

acaggaacag tcagcaagcc acacctgacg cctgctgtgg gcccatccct gcggtgctgg 6000 

agaagacaga caaggcctgg tcactgcctc tgcagggtcc ccagtccgtg gaaggagaca 6060 

gtaatctagg cattttcggt ggggaagctg agctgttctc gtgtcctgaa ggccaggcgg 6120 

gaacagccgt cttcagaggg aagggagaaa atgcacatcg catcagtgga gaagggcctg 6180 

acttccctca gcatggtgga gggaggtcag aaaacagtca agcttgttgc tgggtgacag 624 0 

tgcatttaat aatcaaaata taggctgggt acggtggctc atgcctgtaa tcccagcact 6300 

ttgggaggct gaggcaggtg gatcacttga ggccaggagt ttgagaccgg cctggccaac 6360 

atggcaaaac ctcaactact aaaatacaaa aactagccgg gcgtggtggt gcacgcctgt 6420 

aatcccagct acttgggagg ctgaggcagg agaattgctt gaacctggga ggcggaggct 6480 

gcagtgagcc gagattgtgc cactgcactc cagcctgggc aacagagcaa gactctgtct 6540 

caaaaaaaaa aaaaaaaaaa gcaatacaaa atacaaatat cactttcact aaaagaaggg 6600 . 

atggaagacc caaaacaaac agaaaacaac aaaatggcag gagtaagtcc ccacttatca 6660 

ataataacat tgactgtaaa taggctaagc tctgcaatca aaagagtggg ccaggagcgg 6720 

tggctcacgc ctgtaattcc aacgctttgg gaggctgagg cggatggatc atttgatgtc 6780 

acgagtttta agaccagcct ggccaacaag gtgaaacccc atctgtacta aaaatacaaa 6840 

aattagccag gcggtagtgg cacgcacctg taatcccagc tacttgtgag gctgaggcag 6900 

gagaatcact ggaggctggg aagcggaggt tgctgtgagc caagatggag ccactgcact 6960 

cccacctggg cgacagagtg agatcctgtc ttaagaaaaa aaagagtgga tgaatggatc 7020 

aaaaaacaag acccaaccat ctcttgcata caagaaacac actttaccta taaaaacaca 7080 

ctaggccagg tgtggtggct cacacctgta atcccagccc tttgggaggc ctgactggca 7140 

gatcacctga ggccaggagt ttcagaccag cttgaccgac atggcaaaac cccatctctc 7200 
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ctaaaaatac aaaaaaacaa aaaaaagaaa aaggctggaa gtagtgatgt gtgcctgtag 7260 

ccccagctac ttgggaggct gaggcaggag aattgcttga atccgggaag tggaggttgc 7320 

agtgagccag gatggtgcca ctgcactcca gcctgggtga cagagcgaga ccctgtcata 7380 

aaaaaaaaaa gaaaagaaaa gaaaaacgag aaaaacaaac acaaaattag tagaagaaaa 7440 

gaaataataa agatcagaac aggccaggct catgggcaca gtggctcaac tcctacctgc 7500 

tcaggagttt gagaccagtc tggccaacat ggcaaaaccc catctctcct aaaaatatga 7560 

aaaaaaaaaa ataggctgga tgtggtgatg tgtgtgtgcc tgtagcccca gctacttggg 7620 

aggctgaggt gggagaatca cttgagccca ggaagtggag gctgcagcga gtcatgaatg 7680 

caccctgcac tctagctggg taactggagt gagattctgt ctcaaaaaag caaagaccag 7740 

agcagaaata aatgaaatgg aaatgaagga aacaatgcaa aatgatacaa aaagtttttt 7800 

cgaaaagata aacaaaatca acaaaccttt agccagatta agaaaaaaag agagaagacc 7860 

caaataaata aaatccgaga ttaaaaagga gacattacca ctgataccac agaaattcaa 7920 

aggatcatta gaggcaacta tgtgcaacta tatgctaatg aactggaaaa cctagaagaa 7980 

ctgggtaaat ttctagacac atacaaccta tcaagattga accatgaaga aatccaaaac 8040 

ctgaacaggc cgggcacggt ggcttacgcc tgtaatccca gcactttgga aggcctgaga 8100 

tcaggagttc gagaccagcc tggccaacat ggtgaaaccc catctctact gaaaaaatat 8160 

aaaaattagc cgggcgtggt ggcgggtgcc tctaatgtca gccactcggg aggctgaggc 8220 

aggaaaatca cttgaacctg ggaggcatag gttgcagcga gccgaggttg caccactgca 8280 

ctccagcctt ggcgacagag ccagactcca tctcaaaaaa attaaaataa caaaaacctg 8340 

aacagaccaa taacaagtaa tgcgatgaaa actgtaataa aatgtttccc aacaaagaaa 8400 

gcccaggaac aaatggcttc actgctgaat tttaccaaac attttttttt ttttgagacg 84 60 

gagtctcgct ctgtcgccca ggctggagtg cagtggtgta acctcggttc gctggtaact 8520 

tatgcctctc aggctgcaag tgattttcct gcttcaggcc ccccgagtgg ctggaaatta 8580 

gatggtactt gtcaaacaag gcctggctaa atttctatat ttccttcaag tagaagatgt 8640 

gcttccaaca aaggttgggt tacggctggc ttctgaaaat cttggatttc aaggctcccc 8700 

aaaag 8705 



. <210> 11 

<211> 66933 

<212> DNA 

<213> Homo sapiens 



<400> 11 

tataatcaag 

ctcaagaacg 

agcaatagta 

agcatggttc 

atgagaactg 

gctctttcta 

tgggagatta 

cggttgaaga 

gcgtaggagt 

agtttactct 

gtaatgaacg 

ttctactgac 

ttgccgatgg 

tggttggcga 

gcccaacaag 

ctggaaatat 

gtatcaacac 

tatctgcccg 

agcagcaggc 

aagttatcac 

gctcacgaca 

ttaacctgga 



cgcgttccgt 
ttgatcttaa 
atggctctca 
taaaaaatgt 
tttttgaaca 
tttgggaacc 
gataatgaag 
tacttcgtta 
aatggctcgc 
tgaagtgctc 
tgaccaggag 
tggtcaacag 
gagtcgccgt 
gctggatgat 
tgcttatgaa 
ttctgcgctg 
cgccaaattg 
gtcaggtgat 
atctaacctt 
tcttttaact 
tcagtttgct 
caggtctcgt 



ccagtccggt 
agggttcgag 
gtccccgtgg 
tgtacgtgaa 
ggccattgat 
tgtctgcaat 
cgtgcgcctg 
tcgacaccag 
ggtaatgcca 
cggggtgata 
ctgcttactg 
acaccggcgt 
cgtaaagctg 
gagcagatgg 
cgtggtcagc 
gctgatgcgg 
cctaaatcag 
gcacttcaaa 
catgagcaga 
tctgtgctta 
cctggagcga 
gttccaactg 



gggaagattt 
cctgatgtac 
atggaggagc 
acggatgaag 
caacgctctt 
gaaattttcg 
ttattccaaa 
ctgccccgat 
ttactttgcc 
gtgttgagaa 
aggacgcact 
tcggtcgaag 
ctgcacttac 
ctgcattatc 
gttatgcaag 
aaaatatttc 
ttgttgctct 
aagcctttac 
aaaaagctigg 
aaacgtcatc 
cagtattgta 
agtgtataga 



tcgatatgct 
gtattttgct 
aaattcggga 
ttggtaaagg 
caactggtgc 
atcgtctgat 
acatacgctc 
ggtggattcg 
tgtatgtggt 
gacctctcgg 
ggatgatctc 
agtatctggt 
cgaaagtgat 
cagattgggt 
ccgattgcag 
acgtaagatt 
tttttctcac 
agataaagag 
ggtgatattt 
tgcatcaaga 
taagggcgat 
gaaaattgag 



tcgtgatctg 
taccaaatac 
tgcctgggga 
tcagatccgg 
ctggagaaat 
taaaccacgc 
aatactcaac 
ttaattgcgc 
cgggatgtga 
gtatggtcag 
atcccttctt 
gtcatagaaa 
tatcgtgttc 
aacgattatc 
aatgaatttg 
attacccgct 
cccggtgaac 
gaattactta 
gaagctgaag 
actagtttaa 
aaaatggtgc 
gccattctta 
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aggaacttga aaagccagca ccctgatgcg accacgtttt agtctacgtt tatctgtctt 1380 

tacttaatgt cctttgttac aggccagaaa gcataactgg cctgaatatt ctctctgggc 1440 

ccactgttcc acttgtatcg tcggtctgat aatcagactg ggaccacggt cccactcgta 1500 

tcgtcggtct gattattagt ctgggaccac ggtcccactc gtatcgtcgg tctgattatt 1560 

agtctgggac cacggtccca ctcgtatcgt cggtctgata atcagactgg gaccacggtc 1620 

ccactcgtat cgtcggtctg attattagtc tgggaccatg gtcccactcg tatcgtcggt 1680 

ctgattatta gtctgggacc acggtcccac tcgtatcgtc ggtctgatta ttagtctgga 1740 

accacggtcc cactcgtatc gtcggtctga ttattagtct gggaccacgg tcccactcgt 1800 

atcgtcggtc tgattattag tctgggacca cgatcccact cgtgttgtcg gtctgattat 18 60 

cggtctggga ccacggtccc acttgtattg tcgatcagac tatcagcgtg agactacgat 1920 

tccatcaatg cctgtcaagg gcaagtattg acatgtcgtc gtaacctgta gaacggagta 1980 

acctcggtgt gcggttgtat gcctgctgtg gattgctgct gtgtcctgct tatccacaac 2040 

attttgcgca cggttatgtg gacaaaatac ctggttaccc aggccgtgcc ggcacgttaa 2100 

ccgggctgca tccgatgcaa gtgtgtcgct gtcgacgagc tcgcgagctc ggacatgagg 2160 

ttgccccgta ttcagtgtcg ctgatttgta ttgtctgaag ttgtttttac gttaagttga 2220 

tgcagatcaa ttaatacgat acctgcgtca taattgatta tttgacgtgg tttgatggcc 2280 

tccacgcacg ttgtgatatg tagatgataa tcattatcac tttacgggtc ctttccggtg 2340 

atccgacagg ttacggggcg gcgacctcgc gggttttcgc tatttatgaa aattttccgg 2400 

tttaaggcgt ttccgttctt cttcgtcata acttaatgtt tttatttaaa ataccctctg 24 60 

aaaagaaagg aaacgacagg tgctgaaagc gagctttttg gcctctgtcg tttcctttct 2520 

ctgtttttgt ccgtggaatg aacaatggaa gtccgagctc atcgctaata acttcgtata 2580 

gcatacatta tacgaagtta tattcgatgc ggccgcaagg ggttcgcgtc agcgggtgtt 2640 

ggcgggtgtc ggggctggct taactatgcg gcatcagagc agattgtact gagagtgcac 2700 

catatgcggt gtgaaatacc gcacagatgc gtaaggagaa aataccgcat caggcgccat 27 60 

tcgccattca ggctgcgcaa ctgttgggaa gggcgatcgg tgcgggcctc ttcgctatta 2820 

cgccagctgg cgaaaggggg atgtgctgca aggcgattaa gttgggtaac gccagggttt 2880 

tcccagtcac gacgttgtaa aacgacggcc agtgaattgt aatacgactc actatagggc 2940 

gaattcgagc tcggtacccg gggatcctct agagtcgacc tgcaggcatg caagcttctc 3000. 

ttgtgccggt tgtacgctgt caggtcacac tggtgagtta ggcagggcac agatgcccag 3060 

agcagaggga actttccttg gggattcaac acgtgcaagt cttaggggct ggcaaatcct 3120 

gccctcagct agagaggggg cttttatttg agaccagaat cacctgagca tcctcctgtc 3180 

cccagctgtg tccagcctgt ctgcagggac atcctgagag gaccaggctc tcccctcatc 3240 

cacctgccta agtgccactc tgaaccctgt ccacctgtgc cgtggagggg cgtgacctca 3300 

agctgctcag ccagcagcag gcttggccct ggggggcagc agagacccag gtggctgtgg 3360 

ggtgggtgct tcgtggcgtg gttctgaaac ttcgttggaa gtgtgtggac agtgccttgc 3420 

ctgttctctg tgggacccta tttagaaacg aggtctgagt tactgggggt catcactgtg 3480 

ttctgatggc ccagctgtgt ggaggccgcg gtgcagcccc atccaaggag ccagggccct 3540 

gggtctagcc gtgaccagaa tgcatgcccc ggaggtgttt ctcatctcgc acctgtgttg 3600 

cctggtgtgt caagtggtcg tgaaactctg tgttagctct tggtgttcct gaaagtgccc 3660 

ccgggtctca ggcctcagaa ccagggtttc ccttcatctc ggtggcctgg gagcatctgg 3720 

gcagttgagc aaagagggcg attcacttga aggatgtgtc tggccctgcc ' taggagcccc 3780 

ccggcacggt gctggggcct gaagctgccc tcgggtggtg gagaggaggg agcgatgaag 3840 

tggcgtcgag ctgggcagga agggtgagcc cctgcaaggt gggcatgctg gggacgctga 3900 

gcagcatggc cagcagctgg gtctgcagcc tggtacccgg cgggacttgt ggttggggct 3960 

ggtttgtggc caggagaggg gctggcagga gacaaggggg actgtgaggc agctcccacc 4020 

cagcagctga agcccaatgg cctggctgtg tggctctcag ctgcgtgcat aacctctcag 4080 

tgcttcagtt ctctcatttg taaaatgagg aaacaaacag tgccagcctc ccagaggtgt 4140 . 

catgaggatg aacgagtgac catgtagcat gggctgggtg cgtgtcacct aacatcacca 4200 

gcctttgcaa ggagagccct gggggcctgg ctgagtattt cccttgcccg gcccacccca 4260 

ggcctagact tgtgcctgct gcaggccctt gacccctgac cccattgcac ctgtctccad 4320 

aggagccgag gaggtgctgc tgctggcccg gcggacggac ctacggagga tctcgctgga 4380 

cacgccggac ttcaccgaca tcgtgctgca ggtggacgac atccggcacg ccattgccat 4 4 40 

cgactacgac ccgctagagg gctatgtcta ctggacagat gacgaggtgc gggccatccg 4 500 

cagggcgtac ctggacgggt ctggggcgca gacgctggtc aacaccgaga tcaacgaccc 4560 

cgatggcatc gcggtcgact gggtggcccg aaacctctac tggaccgaca cgggcacgga 4 620 

ccgcatcgag gtgacgcgcc tcaacggcac ctcccgcaag atcctggtgt cggaggacct 4 680 

ggacgagccc cgagccatcg cactgcaccc cgtgatgggg taagacgggc gggggctggg 4740 
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gcctggagcc agggccaggc caagcacagg cgagagggag attgacctgg acctgtcatt 4 800 

ctgggacact gtcttgcatc agaacccgga ggagggcttg ttaaaacacc ggcagctggg 4860 

ccccaccccc agagcggtga ttcaggagct ccagggcggg gctgaagact tgggtttcta 4 920 

acaagcaccc cagtggtccg gtgctgctgc tgggtccatg cgtagaaagc cctggagacc 4 980 

tggagggagc cctttgttcc cctggcttca gtttcctcat ctgtagaatg gaacggtcca 5040 

tctgggtgat ttccaggatg acagtagtga cagtaagggc agcctctgtg acactgacca 5100 

cagtacaggc caggcctctt tttttctttt tttttttttg agatggagtc tcactctgtc 5160 

gcccaggctg gagtgcagtg gtgtgatctc agctcactac aacctctgcc tcctgggctc 5220 

aagtgattct cctgcctcag cctcctgagt agctgggatt acaggtgcct gccactgtgc 5280 

ttggctaatg tttgtatttt tggtagagat ggggtttcac cgtcttggcc aggctggtcg 5340 

caaactcctg acctcaggtg atccacctgc ctcagcctcc caaagtgctg ggattacagg 5400 

catgagccac cacgcccggt caggccaggc ctcttttgaa cactttgcac accatgggtc 54 60 

ttttcatcca ggggggtagg tacagttgta cagttgagga cactgaagcc cagagaggct 5520 

cagggacttg cccagggtca cacagcagga tgtggcaggt gtggggctgg gcctggcagc 5580 

gtggctccag ctttccagca tagaaatctg tgaaagcaga tagtttgtcg gtcggtaggg 5640 

gagactttct gagacccgcc ccagcggctc agagggtagt agccaggggc cttcctgggg 5700 

gctcataacc cagaacactg aatgggaaaa ccctgatgga ggaggcgcag tggagctgtg 5760 

ggtgccgatg ggaagtccca gaggagctgg gaggtcagta gcggtgctgc cctctgtgga 5820 

gcacttagtg ggcaccaggt gtgtttccag gttcatggcc ctgggacctg aagctcagaa 5880 

ggtgaagtaa cttgcccagg gcacccgtcg ggcagcggcg ggcagaggat ttgtgggctg 5940 

tggagcctgt gctcgtggcc cagccctggg ggttgtgagt gtgctggccg gggagctttt 6000 

cctgcaagtg gactggtgtc taggagccag catgtcaggc agcaggcagc gggagtgcag 6060 

caggcagcgg gagcacagca ggcagagggc ggggctcgag cagccatccg tggaccctgg 6120 

ggcacggagg catgtgggag agggctgctc catggcagtg gctgaagggc tgggttgtgc 6180 

cccgaggagg gtggatgagg gtaagaagtg gggtccccag gggctttagc aagaggaggc 6240 

ccaggaactg gttgccagct acagtgaagg gaacacggcc ctgaggtcag gagcttggtc 6300 

aagtcactgt ctacatgggc ctcggtgtcc tcatctgtga aaaaggaagg gatggggaag 6360 

ctgactccaa ggcccctcct agccctggtt tcatgagtct gaggatccca gggacatggg 6420 

cttggcagtc tgacctgtga ggtcgtgggg tccagggagg ggcaccgagc tggaagcggg 6480 

aggcagaggg gctggccggc tgggtcagac acagctgaag cagaggctgt gacttggggc 6540 

ctcagaacct tcacccctga gctgccaccc caggatctgg gttccctcct tggggggccc 6600 

cagggaacaa gtcacctgtc ctttgcatag gggagccctt cagctatgtg cagaaggttc 6660 

tgctctgccc cttcctccct ctaggtgctc agctcctcca gcccactagt cagatgtgag 6720 

gctgccccag accctgggca gggtcatttc tgtccactga cctttgggat gggagatgag 6780 

ctcttggccc ctgagagtcc aagggctggt gtggtgaaac ccgcacaggg tggaagtggg 6840 

catccctgtc ccaggggagc ccccagggac tctggtcact gggcttgccg ctggcatgct 6900 

cagtcctcca gcacttactg acaccagcat ctactgacac caacatttac aaacaccgac 6960 

attgaccgac accgacattt accgacactg acatttacca acactgttta ccaacactga 7020 

catctactga cactggcatc taccaacact gacatttacc gacactgaca tttaccaaca 7080 

ctatttacca acactgacat ctactgacat tggcatctac caacaccaac atttaccgac 7140 

accaacattt accaacactg aaatttaccg acaccgacat ttaccgacac cgtttaccaa 7200 

caccgacgtt taccgacacc gacatttacc gacactgata tttaccaaca ctgacatcta 7260 

ctgacgctgg catctactga caccgatgcc agcatctacc aacaccgaca tttaccaaca 7320 

ctgacattta ctgacactga tatctactga cactggcatc tactgacacc aacatttacc 7380 

aacaccagca tctaccaaca ccgacattta ccaacaccag catttaccaa caccgatgtt 7440 

taccaacgcc gacgtttacc gacgccagca tctaccaaca ctgacattta ccgacaccga 7500 

catttaccga cactgacatt tactgacact gacatctact gatactggca tctaccgaca 7560 

ctgatattta ccaacgccag catctactga cactgatgtt taccaacacc gacatttacg 7620 

agcaccgaca tttactgaca ccaatattta ctgacatcaa catttagcca tgtgatgggg 7680 

gccggcttgg gggcaggcct tgctcttggc actggggatg ctgcagagac cagacagact 774 0 

catggggtca tggacttctg cttcttctcc agcctcatgt actggacaga ctggggagag 7800 

aaccctaaaa tcgagtgtgc caacttggat gggcaggagc ggcgtgtgct ggtcaatgcc 7860 

tccctcgggt ggcccaacgg cctggccctg gacctgcagg aggggaagct ctactgggga 7920 

gacgccaaga cagacaagat cgaggtgagg ctcctgtgga catgtttgat ccaggaggcc 7980 

aggcccagcc accccctgca gccagatgta cgtattggcg aggcaccgat gggtgcctgt 8040 

gctctgctat ttggccacat ggaatgcttg agaaaatagt tacaatactt tctgacaaaa 8100 

acgccttgag agggtagcgc tatacaacgt cctgtggtta cgtaagatgt tatcattcgg 8160 
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ccaggtgcct gtagacacag ctacttggag actgaggtgg gaggatcgct ggagtccaag 8220 

agtttgaggc cagcccgggc aaaggggaca caggaatcct ctgcactgct tttgccactt 8280 

actgtgagat ttaaattatt tcacaataca aaattaagac aaaaagttaa tcacatatcc 8340 

actgccctgc ttaagacaga aaacatgggt gttgttgaag ccagaggcag ctgctggcct 8400 

gagtttggtg attggttcct aagcagttga aggcagtttt gtttttccat agatgtctgt 84 60 

tctccctttg ctgggtgcag cctcgccctg ctgctgtggt cgggtttcag tggcctcgtc 8520 

ccgtggacgc agcctcgccc tgccgctgtg gtcgggtttc agtggcctcg tcccgtggac 8580 

gcagcctcgc cctgctgctg tggtcgggtt tcagtggcct cgtcccgtgg acgcagcctc 8640 

gccctgccgc tgtggtcggg tttcagtggc ctcgtcccgt ggacgcagcc tcgccctgcc 8700 

gctgtggtcg ggtttcagtg gcctcgtccc atgggcgtgc tttggcagct ttttgctcac 8760 

ctgtggagcc tctcttgagc ttttttgttt gttgtttgtt tttgtttgat tttgtttgat 8820 

tgtttgtttt tgttgtcgtt gttgttgccc aggctggagt gcagtggcgc gatctcagct 8880 

cactgaaacc tctgcctcct tgggttcatg ccattctcct gcctcagcct cccacatagc 8940 

tgggattaca agtgcccgcc accacgcctg gctaaatttt gtatttttag tagacagggg 9000 

gtttcaccat gttggtcagg ctggtctgga actcctggtc tcacatgatc cacctgcctc 9060 

ggcctcccaa agtgttggga ttacaggcgt gagccaccgc gcccagcctc tgttgagcat 9120 

attttgaggt tctcttggtg ccagtgatat gtacatgtgt ccccatcgca ccatcgtcac 9180 

ccattgaggt gacattggtg cctctcctcg gggtggatgc ctccctctgt ttccagcaac 9240 

ttctgaagga ttttcctgag ctgcatcagt ccttgttgac gtcaccatcg gggtcacctt 9300 

tgctctcctc agggctccca ggggaggccc gaatcaggca gcttgcaggg cagggcagga 9360 

tggagaacac gagtgtgtgt ctgtgttgca ggatttcaga ccctgcttct gagcgggagg 9420 

agtctcagca ccttcagggt ggggaaccca gggatggggg aggctgagtg gacgcccttc 9480 

ccacgaaaac cctaggagct gcaggtgtgg ccatttcctg ctggagctcc ttgtaaatgt 9540 

tttgtttttg gcaaggccca tgtttgcggg ccgctgagga tgatttgcct tcacgcatcc 9600 

ccgctacccg 'tgggagcagg tcagggactc gcgtgtctgt ggcacaccag gcctgtgaca 9660 

ggcgttgttc catgtactgt ctcagcagtg gttttcttga gacagggtct cgctcgctca 9720 

cccaggcgag agtgcagtgg cgcaatcacg gctcgctgta gcctcaatct ccctgggctc 9780 

aggtgatcct cctgcctcac cctctgagta gctgggacta cagacacata ccaccacacc 9840 

cagctagttt ttgtgtattt tttgtggggg gagatggggt ttcgctgtgg tgcccaagct 9900 

gatctcaaac tcctgaggca caagcgatcc acctgcctcg gcctcccaaa gtgctgggat 9960 

gacaggcatc agccgtcaca cgcagctcaa tgattttatt gtggtaaaat aaacatagca 10020 

caaaattgat gattttaacc attttaaagt gaacagttca ggctgggcgt ggtggcttat 10080 

gcttgtaatc ccagtacttt gagaggctga ggtgggcaga tcacctgagg tcaggagttt 10140 

gagaccagcc tggccaacat gatgaaatcc agtctctact aaaaatacaa aaattagccg 10200 

ggcatggtgg caggtgcctg taatcccagc tactcgggag gctgaggcag gagaatcgct 10260 

tgagcccggg aggtggaggt tgcagtgatc tgagatcatg ccactgcact ccaatctgtg 10320 

tgacagagca agactctgtc ttgaaaaata aataaataaa aaaaatttta aaaagtgaac 10380 

aattcagggc atttagtatg aggacaatgt ggtgcaggta tctctgctac tatctacttc 10440 

tagaacactt tcttctgccc tgaaggaaac cccatgccca ccggcactca cgcccattct 10500 

cccctctctc ccagcctctg tcaaccacta atctactttc tgtctctggg ggttcacttc 10560 

ttctggacgt tttgtgtgac tggaatcctg caatatgtgg tccctgcgtg tggcttcttt 10620 

ccatagcatt gtgttttcca gattcaccca cacattgtcg cacgttatca gaatctcatt 10680 

cctgactggg tgcagtgggt taggcctgta atcctaacat tctgggaggc caaggcggga 10740 

cgatcacttg aggcaggagt ttgagaccag cctggccagc ctagcaagac cccagctacc 10800 

aaaaaatttt aaaagttaac tgaacgtggt ggtggtgggc acttgtggtt cccagctacc 10860 

tgggaggctg aggtgggagg atcgcttaag cccaggaggt caaggctgca gtgagctatg 10920 

atcgcaccac tgcactccag cctggacaac agagcaagac cctgtctgaa aaaaaaaaca 10980- 

aaaaaaaaag ttcctttctt tttgtggctg gatgacatcc cattgtatgg ccacagcaca 11040 

ttttgtttgt ctgtttatcg ggtggtgggc agtggtttcc accttttgtc tcctgtgaat 11100 

aatgctgctg tgaacatttg aattcaagtt tttgtttgaa cacctgttgt gaattatttg 11160 

gatatatgtg taggggtagg attgctgagt cctatggtaa tgttaggttt gacttactga 11220 

ggaaccatta aactgttttc aacagtggct gcgccgttct gcatccccac cggcagtgtg 11280 

tgagggttct gactttacct cctcacaaac gcttcttttc catttaaaaa aatattcagc 11340 

caggtgctct ggctcacgcc tgtaatccca gcactttggg aggccgtggc gggcggatca 11400 

cctgaggtca ggagttcgag acgagcctgg ccaacatggt gtaaccccat ctctaccaaa 114 60 

aatataaaaa ttagccgggt gtggcagcgg gcgcctgtaa tcccagctac ttgggaggct 11520 

gaggcaggag aatcacttga acccgggagg cagaggttgc agtgagccaa gatcgcgcca 11580 
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ctacactcca gcctgggtga caagagtgaa actccatcta aaataaaaca aaaataaaaa 1164 0 

taaataaaaa tttattaaaa cattcatcac agccagccta gtgggtgtcc catgtggctt 11700 

tgcctcgcat ttccctgata actaggatgc tgagcgtctt gtcccaggct tgccacacct 11760 

cagcactttg agatacgtcg cacagtcccc atttgcgaac gagaaatgag gtttagggaa 11820 

cagcagctgt gtcatgtcac acagcgagca gggggtctct gagccgtctg accccacagc 11880 

cgaccaagct ccaatcctta ccgcctccta gtgttgtgga tgtagcccag ggtgctccca 11940 

catttttcag atgagaacac cgaagctcaa aacaggagcg ttttgtccac attggataca 12000 

cgatgtctgt ggtttggtcc tgaagtcact ttatatctca gtggtccaga ctggagtagg 12060 

acagggggtt ctggggaatg gggaaggtgt ctcaggtgaa aggaaggaat tccagattct 12120 

ccatactgtc cttgggaagt tagaagactc agagggtctg gcaaagtcag acaaagcaag 12180 

agaaatgcag tcaggaggaa gcggagctgt ccaggaacag gggggtcgca ggagctcacc 12240 

cccaggaact acacttgctg gggccttcgt gtcacaatga cgtgagcact gcgtgttgat 12300 

tacccacttt tttttttttt ttgaggtgga gtctcgctct cttgcccagt ctggagtgca 12360 

gtggcacgat ctcggctcac tgcaagctct gcctcccggg ttcatgccat tctcctgcct 12420 

cagcctcccg cgtagctggg actacaggcg cctgccaccg cgcccggcta atttttgtat 124 80 

ttttagtaga gatgggattt cactacatta gccaggatgg tctcgatctc ctgacctcat 12540 

gatccgcccg tctcggcctc ccaaagtgct gggattacag gcgtgagcca ccgcgcccgg 12600 

cccgatttcc cactttaaga atctgtctgt acatcctcaa agccctatac acagtgctgg 12660 

gttgctatag ggaatatgag gcttacaggc catggtgctg gacacacaga agggacggag 12720 

gtcaggaggt agaagggcgg agagagggaa caggcggagg tcacatcctt ggctttcaaa 12780 

atgggccagg gagagacacc ctctgagcat ggtaggacag gaaagcaaga ttggaacaca 12840 

ttgagagcaa ccgaggtggc tgggcgtggt ggcttacgcc tgtaatccca acactttgga 12900 

aagctgaggt gggtggattg cttgaggcca ggagttcaag accagcctgg ccaacatggt 12960 

gagaccccgt ctctactaaa tatacaaaaa ttagccaggc gtgatggtgc atacctgtaa 13020 

tcccagctgc ttgggaggct gaggcaggag aattgcttaa acctgggagg cggaggttgc 13080 

agtgagccga .gatcccgcca ctgcactcca gcctgggcca cagagtgaga ctccatctca 13140 

aaaaaaaaaa aaaaaaaaga taaaaagacc aaccgaggaa ttgaagtggg ggggcgtcac 13200 

agtagcagaa gggggatcgt ggagcaggcc accctgtggt catgcactgg aagctcatta 13260 

cctgacgatt tggagctcat cactgggggc ctaaggagaa tagatactga aggatgagga 13320 

gtgatggcgc ggggcacggg tgtctttggt ggccagaact tggggactgc tggggtgcct 13380 

cactgcaggc cttctcagcg ccctttatat gcttacacag gctgtttcta agagggggat 13440 

acattgcata agcgttttca gactacctca tcatgggtcc ctttctttac cctctgtggc 13500 

cctggtggcg cactctctgg gaaggtgcag gtggatgccc agacccgccc tgccatccac 13560 

ctgcacgtcc agagctgact tagcctcgag attgctgctg gcacctcctg ccccgggaca 13620 

cctcggatgt gcccgtggag atgctggctc tgtgttttct gctggagttt ggtgcgtctt 13680 

ttcctcctgc aagtggccac cgctcttggg tatgtcctca ggcttctgcg agtcatggct 13740 

gcttctcagg tccttgccca gcgccaggag caaaccctcc tggcactttg ttcaggggtg 13800 

gatgcgccag tgttcctgct gtggaccccc atctcacatg agggtcttgg gcctgcaggc 13860 

tcgttcagga aacacccgct gagtacgcag tgtgtgccag ctgtgtccca ggcaatggcg 13920 

gggacagtgg ctgctgctgg ggttgtggtg gcttctgggg actctgggga cagctgaggt 13980 

gcaaggagcc acggctcctt gaggatgcag ttggactcca ggtggaaggg atggttgggg 14040 

gaggtataaa tggggtcagg gaggagacac atttggaaca atgggaacat ttttaagatg 14100 

ctatgtcggg aggcaacaag gtggccaacc caggtgctga ggagcccaca ccagccctgg 14160 

acgtgttttg ccgctcacct ttgctgggga gtggtgggag agaggattcc gttccacgtg 14220 

gtggtgtgcg cagctgggct gtgtggagct gggcgctagg aggaaggtgc tttctgcggg 14280 

gctagccggg ctctgccttt gaacacaatc aggctccagg ttttcagcat ccagtgcatg 14 340 

agaggacttc acgggcagct gtggctgatc ccttgatgaa ttgggagaag aacaaaggtc 14 400. 

tatgaaatga ggtttcatgt agatggcatt agagacgccc acaacagatt tacagagtgg 14 4 60 

agcggagacg gcggatgggt ctgggaggcc cctcctgctg gccttgactg tgacagctgt 14520 

cctgggaatc agcttccagg ccgccccagc agcctgactg acacacacag gggttttagc 14580 

cccatcctgc gaccagctgt tgccatcatc agtgacagct gggagtggcg gtggttccag 14 640 

ccctgggcac cctccccacc tgctggggcc cacccagggc agtcctgaca cctacaggtt 14 700 

gcttggagcc gcatccgagt cctgccccac cacgtgtgaa gcccgagtgg tcgtgggctg 147 60 

aggtcccctg attgcatccc cacttccctt ctgcttcaca tagctgcctc ttctcaccgt 14820 

ttttccagcc tcctgggcta ggaattccag tgttgtgctg gctttgcccc aggacacctc 14 880 

cttagccctc ttcctgagtc tagagccccg ggggttggaa gttctggccc ctgggacacc 14 940 

tgcagccaca ctcagcttct cctgtgagcc tccagcatgt cccctcagga ccaagccctc 15000 
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acgttcttgc 
cccctctgcc 
ttttatatcc 
gtgggtctct 
gctggaggga 
ccctctcccc 
ccttggcctt 
ctggaagctg 
ccatcaccgt 
ctcccagagc 
acggggactc 
gacaggcacc 
tgggacaggg 
gggctttggc 
tgtgtgtgtg 
catgaagcaa 
cttcttccct 
tgggctcatg 
ctgcacagca 
agcacagcta 
cggccggccc 
ggaatgggac 
gctggaccat 
acaagtgcac 
tgccacctgt 
aaaaaaaaaa 
aaaattatat 



ctccccgccc 
ctgcccttct 
tgtgccacaa 
gaggcccacg 
cagcccagcc 
tggaagccgg 
ccagggaagg 
tggcccttgt 
catcatcacc 
agagaccctc 
aggacgtacc 
cctgactttt 
tctgttggct 
actgcagagc 
tgtgtgtgtg 
gataactgtt 
cagacaacag 
tctgtttgca 



acctgggctc 
cccctctacc 
gacatggctg 
ctccagtgcc 
ctctcccgaa 
ggtcactcca 
tcctctttcc 



agccagggga 
ctgcccttct 
tgtgtgaaag 
actcttccca 



atgcccctcc 
atcatcattg 
tgaggtccag 
aggtctgggt 
gctaagtggg 
gcactgtgct 
tgtgtgtgtt 
tttgcgtgcg 
gctgtttcct 
aactctcccc 



ggcgactgtc 
gggtgggctg 
attccttggt 
agaagctggc 
ctcattgtcc 
agaactagag 
tttcacacca 



tctgtcaccc 

ggctcaagcg 

caagcccggc 

cttttctgga 

gcccagcctg 

ctagcgattt 

tcaagtgatc 

aagaacctat 

tggtcccagc 

tgcagcgagc 

ctctttaaaa 

gtgctctcca 

tctatccttt 

cttccggcac 

tttggctctg 

cgcccagagg 

aggcctcctg 

agagtggctt 

gccactttgg 

gatgttttcc 

agtagggggc 

agggagtggc 

ttccgcaggc 

ccagccgggt 

tggagcaagg 

cttgggccct 

cttgcctcac 

aatagagcaa 

gtggctcacg 

tcgggagttt 



gccaaaggag 
ggaattcaaa 
aggctggagt 
atcctcctgt 
taattttttt 
actccatctt 
tttttgtttt 
gggaggctga 
aggaggtgaa 
ctcttaaaaa 
tacttaggca 
caagatcgtg 
aaaaaaacca 
tgctgctttc 
acaaaaaagt 
tggcgttcct 
ttccctccca 
cttggctcgg 
agtcagccca 
ttgccgatgg 
gattccttcc 
cagttggccg 

ttgggggagt 
ttctgagcca 
cctgccggcc 
ggaacagccc 
gtagagaggg 
ctgtgggctc 
ttcagttcaa 
gaagagaaaa 
cctgtaaatc 
gtgaccggcc 



cagccgagtc 
cagagtcagc 
ctttgggatc 
gtgctggtca 
agctcggaga 
tcagacacag 
caggagtcga 
tgctcaagct 
gtttcatgac 
tttcagtgtc 
gcagtgctat 
ctcagcctcc 
ttaattttag 
ggcctcccaa 
ttcactgata 
ggtgggagga 
ctatgatcat 
tatatattta 
tctgaggtgg 
tcactacact 
aaaaacatgt 
tgctccagag 
ttgctgacct 
gggtgtccta 
ggagccagga 
ctcaggcatt 
gagggaacct 
gttgggccgg 
tccagtatta 
ggtccaccgg 
gtcctgggtt 
ccataggggt 
cagagccagc 
gtcgcctcct 
aggaagtggc 
ctttttattg 
catgtctgga 
atctctccaa 
ccagtacttt 
tgaccaagat 



ccccagcccc 
gccgtatgcc 
aaatggcgac 
agggtctgtg 
tcatttcgct 
cctgagttgg 
caggagtgac 
tggaggtgac 
cccagggatc 
tgtgtgtgtg 
cgcacatgtg 
tttgggtttt 
tttaaacacg 
agaggaaaca 
ctgcaggacg 
tcccgtcatt 
ggggggcgtg 
ggacagggct 
cctctcgggt 
gagctacggc 
gagtgggttt 
ccatgaaaat 
cataaataat 
ggcatggctc 
tgagtagctg 
taaagacagg 
agtgctggga 
aagttttgcc 
tcgcttaagc 
gtcattgcat 
aaaagtattg 
gaggatggct 
ctagcctggg 
attggaacac 
acccttatgg 
ctgtcctgga 
aatttggccc 
caggcacgtt 
cttggaaata 



aggcctggct 
ctcctctgcc 
tggcagggtc 
cccgctggcc 
atgtgcccag 
atggtgggga 
acctggtccc 
cgctcggttg 
tgtctgtgag 
ggtctccgtg 
ccccaaacct 
atcacttaca 
tggggagagg 
tgtgtgtgtg 
tataagatct 
gtgttcaaca 
tgctgtcaga 
cagggttctt 
gcagcagccc 
ccctcccagc 
cgcctgctct 
ggacccttgg 
ctagtttcat 
ccccgggcca 
tacatttttt 
tatatggaat 
ttcttgagac 
gctgtaccct 
ggactacggg 
gtctttctat 
ttacaggctc 
gggtgtggta 
ccaggagttt 
tccagcctgg 
ggtgtggtgg 
tgagcccagg 
tgacagagcc 
agccatgcct 
cctgaaagct 
aaattcatct 



gggagcgtct 
ccgccatggc 
tggcatctct 
gtgccctcat 
ctgcccccgg 
catcctgctt 
tgcctggagg 
gcccgagttc 
ccggcctggt 
ctgacccctg 
cgtgcccttt 
gcgggagtga 
ctatatccct 
tgtgtgtgtg 
ttttttatta 
gagtggggta 
gggtgggtct 
cataaaaaca 
tgcccctcag 
tggcagccgg 
gctcaccctg 
gtggcctctg 
ttcctgaaaa 
gatccagccc 
aattacttga 
tcaaaaaaaa 



gccccaggtc 
tcaagatgtg 
gagcaactga 
gtgccctggg 
ctggaatcca 
ctctgtggga 
gtcttgcgct 
ctcactttgt 
tgccggccgc 
ctcttcaatg 
agtttggtat 
gagataatag 
gggaggctga 
ggagaaaccc 



ctgttatttc 
ctctgcatct 
tctggctcca 
tgggggaggc 
ctgaaagttg 
gagctgctca 
attctgggat 
ggtggcaagt 
ggctctgccc 
tgccgaggct 
tttggggcca 
tgcccagcac 
aagccaggga 
taaaattaag 
tgacctctga 
ggcgggcaga 
cgtctctact 



agggtctcgc 
tgacctccca 
tgtgtgccac 
gttgcccagg 
gagccacgga 
gtgtgtgcct 
gaggctgggc 
gtgacagagc 
ctcacgcctg 
agtttgaggt 
cagaccctgc 
gttcagtcac 
gaaaatattt 
cccaagttct 
tgaactctgt 
tgtcccctga 
ggaaaggcag 
ctgacccagc 
tcctcagaag 
ttgcaagcct 
ctgggtggaa 
ggtgaggttc 
atccaggaga 
acagccagcc 
cctgggagtg 
ccttgtttgc 
aatggacttc 
aaagtgtgga 
gctgggcgcg 
tcacctgagg 
aaaaataaat 



15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16500 

16560 

16620 

16680 

16740 

16800 

16860 

16920 

16980 

17040 

17100 

17160 

17220 

17280 

17340 

17400 

17460 

17520 

17580 

17640 

17700 

17760 

17820. 

17880 

17940 

18000 

18060 

18120 

18180 

18240 

18300 

18360 

18420 
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aaataaataa ataaataaat acaaaattag ccaggcatgg tggcgcctgc ctataatccc 18480 

agctaaggca ggagaatcgc ttgaacctgg gaggcaaagg ttgcagtgag ccaagatcac 18540 

gccattgcac tctagtctgg gcaacaagag tgaaactccg tctcaaaaaa aataaataaa 18600 

taaaaaataa aaatagtgac ctctggccag gtgtggcagc tcatacccgt aatcccagca 18 660 

ctttggaagg aaggccgaga tgggcagatt gctttagcac aggagtttga gaccagcctg 18720 

gccaacatgg tggaacccca tctctacaaa aatagaataa aatttaagag gtaatagtga 18780 

ccttttggta gatcgaaacc tggattgctt tctttttcta aatgctgatt cttttctttg 18840 

tggtgtttgt gttctgtgcc gatgtccctc ccccagccct gttattgtga gtggaagaag 18900 

gggaaagggt tcgcccgcta ctgtgagccc ctcctctcac gctgggtgtc cttggagaag 18960 

cctgcacttc ttcattgtac gccagggctg ggtccctccc tggagtggtt ctgtgctgct 19020 

gggatggggc caacccctca gatgttttct gagtgtcaca cacaggtgtg tgcattcatg 19080 

gcctttgcgt gtcttcctgt tgtggaggca aaaatgtgaa gaaccctaga tgattttggg 1914 0 

accagggctc catcacctgc tgttcattgc acaccggagc atccaggcat gggtggagag 19200 

ctcagacttc caggcacggt cgcaggggct ggtctaacca tgttcccgcc cgcctgctcg 19260 

tcagaaccgc ctgttgggag ctgttatcat gataccatac ctgggccctg ggctatccga 19320 

ttctgactta attgctccag gttggggcca ggccgttgtt tgctgttttg ttgtttcttc 19380 

tgtgacgtta gccactgggc taatctgagc ccctcagtta caggtggaga aactgagacc 19440 

catgggggtg caaggacttg ccgaggaccc agagcccctt gggggcagag ctgaggcggg 19500 

gcctggcttt gggtcccaga gcttccagtc cccttcccgc tctcctaaca gctttttttt 19560 

ttgagacaag atctcaccct gtcacccagg ctggagtgca atggcatgat ctcggctcac 19620 

tgcaatcttc gctagctgcg ttccagcgat tctcctgcct cagcctcccg agcagctggg 19680 

attacaggtg tgtgccgcca tgcccagctc gttttttttt gtacttttag tagagatagg 1974 0 

gtttcaccat gttggccagg ctgatctcga actcctgacc tcaaatgatc cgcctgcctc 19800 

ggcctcccaa agtgctagga ttacaggctg ggatcacact gtgcctggcc ctagcagctt 19860 

tgtcctgtgc catccaacaa cagatgaccg aagtctttgt ttcttaacat gcattccatc 19920 

tgccttacag ttttgccacc tgcaaaacag aggacttgtc gcttttctgg taagctggaa 19980 

atgtaatctg gtagcaggag gcctgtggaa gcttgccttt aatggccttg tgtctctttc 20040 

atcctgtcct gagagccgga gaacttggat gttgcaccta actcaacctt cctgttaaca 20100 

tacagttctg caggctcatg gatcatcaga accacgtcct atctcacgcg gctgtatgct 20160 

tccgttggtt caggtgtttt taccttgaca gtattttctc ctcggtggct tttgcggtgg 20220 

ttgcttttaa tcagcattga ctcttcaaga aaaatattta gctgctacat ctcagaggag 20280 

acagggtgga aagcatctga gacctgcagg ctcagactta gaaccagaag tgccctcaga 2034 0 

gttcatccgg ccctgaccca gcgggaaatg agttcacaga gaagcgggag aactttgccc 20400 

caggccctgc cgttgctcat aactgcccca ggtccttaca tttgctccag gtcctgcccc 204 60 

aggccctgca gttgctcata actgccccag gtccttatat ttgctccagg tcctgcccca 20520 

ggtcctgcag ttgctctgtg tggtgggtgt gatctggagc cctccgccca ttgctgcacc 20580 

tggggcaggc attgctaatt gatcccagga ctccttcctg cggagcacgc cctggttctc 20640 

caggcagccg ctgcctgtca gcctgcagtg gttcgggaga ggacacctgc ttgcctggtc 20700 

tgttccaaat cttgcttctc atcccagcac aggtaggggg tgctatggga aagggatcct 20760 

cagttggccc tgtcactgct ctatcagctg gggacgtggc atcctagtga aaacatcatg 20820 

gccgggcgcg gtggctcacg cctggaatcc cagcactttg ggaggctgag gagggtggat 20880 

cacttgaggt cagaagttcg agaccagcct ggtcaacatg gtgaaaccca tctctactaa 20940 

aaatacaaaa attcgccagg tgtggtggcg ggtacctgta atccgagcta ctcgggaggc 21000 

tgaggcagga gaatcgcttg aacctgggag gtggagcttg cagtgagccg agatcttgcc 21060 

actgcactcc agcctgggca acagagtgag acgctgtctc aaaatctcaa acaaacaaac 21120 

aaacaaaaaa caaacaaaca aagcgtcatt tatccagcac ccctggggaa ccatgctacc 21180 

tggtgtttta tggtacctgg caaggtgcag gtgaagttgc tgctcttggg cattgaaccc 21240 . 

gtcttgtttg gggcagctca ggccccaggc agggtccggg ttggctctcg ttggtgtggc 21300 

cctggcccat ccagacctat atttctgccg tcctgcaggt gatcaatgtt gatgggacga 21360 

agaggcggac cctcctggag gacaagctcc cgcacatttt cgggttcacg ctgctggggg 21420 

acttcatcta ctggactgac tggcagcgcc gcagcatcga gcgggtgcac aaggtcaagg 21480 

ccagccggga cgtcatcatt gaccagctgc ccgacctgat ggggctcaaa gctgtgaatg 21540 

tggccaaggt cgtcggtgag tccggggggt cccaagccat ggctcagcca tgcagacttg 21600 

catgaggagg aagtgacggg tccatgcctg ggcataagtg ttgagctcag gtgccccgac 21660 

ctggggaagg gcaggacagg aaaggtgaca gtatctggcc aaggacagat gggaagggac 21720 

caagggagct gattagggag tggttatgga ctaggaatgt cggtaacaat ggttagaaag 21780 

tgactaacat ttgttgagca cctgctgtgt gcccggccct ggccgggagc cttcgtgccc 21840 
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acagtgaccc cgtctgcaaa tgtagttcct tgccctactc gcactgggga gcaggacgca 21900 
gagccgtgca tctcacaggt gccaagctca ggactccctc ctgggtctgc ctgggctggg 21960 
ctgtgcttgt tgcccctgtg gcccacgcat gtgcaccttc cacctgaaag ccaggatctt 22020 
caggacgctc cccgaggagg tcgttgtctg gcacaatgat ttgtctcttc ctgaaaaggt 22080 

gacagagtta cactggagag agcagcatcc aggtgcggca gggacaggcc tggggctcgc 22140 

gggcagggac tctgtgtcct gccggggtcc cacactgcac ctgcttgtca gaggcactca 22200 

gtcaatcttt gctgatgaag gatgagagga cagaggacgt gatgcttgct gctgcattgc 22260 

ctgcagtcct gggtgagatg cccgggttga ctctgctgcc cgtcgggtgg atgtgatgtc 22320 

agatccccgg ctttaaaata cgagggagct gggaattgag ggagcaggtt ggggcagaaa 22380 

gcacagcccc gtggaagcct ggagctgagg cagtgtgggc gacccctgga gcagtgagtg 22440 

cttccttcat ggccttcatc gcaccctgca gtcctcatgt aggggatgcc atccatgaat 22500 

ttagttttcc cagcctcctt taaaaacgcg ttcatgctgg ggccggggca gtgcagtggc 22560 

tcacatctga aatcccacca ctttgggagg ccgaggcggg tggatcatga ggtcaggaga 22620 

tcgagaccat cctggctaac aaggtgaaac cccgtctcta ctaaaaatac aaaaaattag 22680 

ccgggtgcgg tggcgggcgc ctgtagtccc agctactcgg gaggctgagg caggagaatg 2274 0 

gcgtgaaccc gggaagcgga gcttgcagtg agccgagatt gcgccactgc agtccgcagt 22800 

ccggcctggg cgacagagcg agactccgtc tcaaaaaaaa aaaaaaaagt acaaaaaaaa 22860 

aaaaattagt ctgggtgtgg tatcacgcgc ctataatctc actactcgag aggctgaggc 22920 

ggagaattgc ttgaacccag gaggtagagg ttgtagtgag cccgtatcgt accactgccc 22980 

tccacctggg caatagagcg agactctgtc tcaaaaagaa aaaaaaaaaa agaacattta 23040 

tgccaggtgt ggtggctcat gcctgaaatc ccagaacttt ggaagactga ggcaggagga 23100 

tcacttgagc ccagaaattt gagagtgtct tccctgggca acatagagag acctcatctc 23160 

taccagaaaa aaaaaaatta gcccggcatg gtggcatatc cctgtggtcc cagctactta 23220 

gggggctgac gtggcaggat cacctgagtc tggaggcaga ggttgaagtg agctgagatc 23280 

atgccactgc actccagcct gggtgacaga cagagaccct gtctcaaaaa aaaaaaaaaa 23340 

aaaaagcatt tactatccac catggaaggt gagactgacc tgtgagtgat tgttcaaaga 23400 

acaaaaaata aaccccagag ataagacaaa agggtgcctc catgggggtg tgatttaaag 234 60 

ctgagaaatt gggcttcttc cccctcccct ctcaccccgt ggtttgctaa aggagatggg 23520 

aaaaaggatt ctttttttgg ctgaaatatt taacactaaa ttaaagccaa ttttaacagc 23580 

actttggttg atgagtgaaa ttaacagact ggccaaaaat aaacgaacgg tctgtactat 23640 

gtgaaaaaga ggcagctttg gccatgctgg gccaatgtga gttttcaggg ttgctgggaa 23700 

tgtctgtgaa tcggaggaag ggcctagctg ggactctcag gagccaaggc cctgaggggc 23760 

aacttgcctg gtccctgccc tgaggcgttc actgctttct tcctgggcca gatcacaggc 23820 

ccggaggctg gaccactggg ctggcactct tgccgagctg ctccctgact tcctgaccat 23880 

gctcctttca gcagccttgc tgcactttag tttccttgaa tgaaaaatgg ggatgagaat 23940 

agctcctacc tccaaggtga atggagtgag ttcggacagg tgactccctg ggaccagtgc 24000 

ctggcgcctg acaaggtcca gtcagagccc gcactgctgt tactgatacc cttggctgta 24060 

ccaggggaga acttggttgc cattgccagg tgttctccca ccacccccac tactgtccct 24120 

gtttgatgtg tggcgggaat aaagctgtgc acattggagc ttttggcaca tcctggcttt 24180 
caggtgaaag gtgcgtgtgt gtttgagggt ttagcctggc caacccagcc atgaggtcgg • 24240 

acctgacctg ggggtgagtc ctgagctcgg cacccctgag ctgtgtggct cacggcagca 24300 

ttcattgtgt ggcttggccg cacccctttc cctgctgggc tgttgatgtt tagactggag 24360 

cctctgtgtt cgcttccagg aaccaacccg tgtgcggaca ggaacggggg gtgcagccac 24420 

ctgtgcttct tcacacccca cgcaacccgg tgtggctgcc ccatcggcct ggagctgctg 24480 

agtgacatga agacctgcat cgtgcctgag gccttcttgg tcttcaccag cagagccgcc 24540 

atccacagga tctccctcga gaccaataac aacgacgtgg ccatcccgct cacgggcgtc 24 600 

aaggaggcct cagccctgga ctttgatgtg tccaacaacc acatctactg gacagacgtc 24660 . 

agcctgaagg tagcgtgggc cagaacgtgc acacaggcag cctttatggg aaaaccttgc 24720 

ctctgttcct gcctcaaagg cttcagacac ttttcttaaa gcactatcgt atttattgta 24780 

acgcagttca agctaatcaa atatgagcaa gcctatttaa aaaaaaaaaa gatgattata 24840 

atgagcaagt ccggtagaca cacataaggg cttttgtgaa atgcttgtgt gaatgtgaaa 24 900 

tatttgttgt ccgttgagct tgacttcaga caccccaccc actcccttgt cggtgcccgt 24 960 

ttgctcagca gactctttct tcatttatag tgcaaatgta aacatccagg acaaatacag 25020 

gaagactttt tttttttttt tttgagacag agtcttactc tgttgcccag gctggagtac 25080 

cgtagcgtga gctcagctca ctgcaacctc cgcctcccag gttcaagcga ttcttctgcc 25140 

tcagcctcct gagtagctgg gactacagac atgcaccacc acacccagct aatttttttt 25200 

atatttttag tagagacagg gtttcatcat gttggccagg ctggtcttga actcctgacc 25260 
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tcaggtgatc 
tcccggcata 
tggccttctg 
taatcccatc 
cagcctcacc 
ggcatggcgg 
tgaacctggg 
caacaaaagt 
tttgtttatg 
tatgggctct 
ctctgttcta 
ttctgctttt 
gagtgcagtg 
ctcatctcag 
tttttttttt 
caaactccta 
gccaccatgc 
ctttaagtca 
agtaaaaatt 
tgcctttaag 
agaattctgt 
caaactcttt 
ttaggcttaa 
gcctgtaatc 
gagaccggcc 
gggcgtagtg 
ttgagcccca 
agcgacagag 
tatatataca 
atataatata 
gtaaatatat 
tataaaaata 
atatgtaatg 
tattatttta 
gctgcatttc 
cagtagctgc 
gcctggcgtg 
caactgaggt 
agaatacaaa 
ggctaaggct 
accattgcac 
atgctggttc 
ccttcatgaa 
gcatggccgt 
tcgaagtggc 
acccgaggtc 
cttgtgttca 
ccgtgtggcc 
ctggtagggc 
ggcacagtgc 
agggtgggta 
t'tccctaagg 
cctgaccagg 
ggtccccgga 
ggacccacct 

gggcacgggg 
tttctgtgtg 



tgcccgcctc 
ggaaaacttt 
attctgtaaa 
actttgggag 
aacgtggaga 
cgcctgcctg 
aggcggaggt 
gaaactccgt 
gattgtctgt 
aaagcctaaa 
aagtattgga 
tgttgttgtt 
gtgtaatctc 
cctccctagt 
tttttttttt 
ggctcaaaca 
ccagcctatt 
caagggttct 
gtccataacc 
tcttttttct 
atcctgattt 
ggcacaatct 
ctgttctttt 
acaacacttt 
tgggcaacat 
gtgcatagct 
ggaggttgat 
caagacttgt 
tatatacata 
taatataaaa 
aatatataat 
tatatttata 
tatatttttt 
tataaaatat 
aaacattcat 
tgccagtggc 
gtggctcacg 
ctggatttcg 
aaaattagcc 
ggagaatcgc 
tccaggctgg 
ctaaaatgtg 
cgggagctcg 
tgactggatg 
gcggctggac 
gctggccctg 
cctcgtatga 
ctcagtgatt 
cctcagaggt 
tggctgcttt 
tctgtgtcga 
tcacataccc 
tctccttgct 
cctcaggctc 
cgagaggcaa 
ttcagacagg 
gtggacacag 



ggcctcccaa 
ttgccttcta 
gaaagtttga 
gccgacgtgg 
aaccccgtct 
taatcgcagc 
tgtggtgagc 
ctcagaaaaa 
ggcagctttt 
acatgtgcca 
caactacaat 
gttgttgttg 
agctcactgc 
agctgggact 
ttgtggagac 
cccacctcag 
ctactgtttg 
ttgtagaaaa 
tctccagcca 
ctaagatagc 
tttcacttga 
tttaaatgta 
attttgcgtg 
gagagggtga 
agtgagaccc 
gtagtcccag 
gctgcagtga 
ctggggaaaa 
cacgcacaca 
atatatattt 
ataaaaatat 
aataatatat 
aatgtatgat 
atataaaatc 
ttaaaaactt 
caagcatctc 
cctgtaatct 
agaccagcct 
aggcgtggtg 
ttgaacccag 
gcaagaagag 
gcccttttcc 
gtggagcacg 
ggcaagaacc 
gggcagttcc 
gatcccacca 
gacagtgcgg 
agagctgtac 
catggagttc 
cacctgggcc 
ttttacagat 
agttagagca 
ttgcactctt 
ctccgaaatg 
atgtgctttg 
ccccagatgt 
tgctcacccc 



agtgctgaga 
aagaagagtt 
ttggtggctg 
gcatatcacc 
ctactaaaaa 
tactcaggag 
tgagatggca 
aaaaagtttg 
gttctgccga 
tccgcccctt 
gtttgctcat 
agatagggtt 
agcctcgacc 
acaggcacac 
agggtttccg 
cctcccaaag 
tattacatag 
aaatatatat 
gagacgaccg 
atttctcttc 
cattacaaca 
tgaaatactc 
tgctggttac 
ggcaggagga 
catctctaca 
ccaccaagga 
cctgagatta 
aaaaaaaaaa 
cacataatat 
ataaataaaa 
atattatata 
aatacatact 
atataatata 
tccaagttgc 
gaatgctggt 
gggagatgtc 
cagcactttg 
tgccagcttg 
gcatgtgcct 
ggggcagagg 
cgaaactccg 
tcctcacctg 
tggtggagtt 
tctactgggc 
ggcaagtcct 
aggggtaagt 
gggtgccaac 
tgatgtcatt 
cttcgtggag 
gtctcaccga 
gcagaaacag 
gagctgggcc 
gccaaaacca 
gcctcttgga 
agctgccagg 
gtggggcgtc 
ttaaagcacc 



taacaggtgt 
tagcaaacta 
ggtgcggtgg 
tgatgtcggg 
tacaaaaaaa 
gctgaagcag 
ccattgcact 
attggtgtaa 
gatgagttgt 
tacagaaaaa 
ttattattct 
tccctctgtc 
tcctgggctc 
acc'accactc 
catgttgccc 
tgctgggatt 
ctttaaaaga 
ataggaaagt 
ttgctgacac 
atcacagtca 
ggtatttgat 
cactgcacag 
agccgggcac 
tcacttgagc 
aaaaactttt 
ggctgagttg 
ctccactgta 
aatatatata 
aaaaatatat 
tttataaatt 
atatataata 
tataagtata 
catttataaa 
tttttccaaa 
gatctggtcc 
tacaaaacac 
ggaggctgag 
gtgaaacccc 
gtaatcccac 
ttgcagtgag 
tctcaaaaaa 
ctgccagacc 
tggccttgac 
cgacactggg 
cgtgtggagg 
gtttgcctgt 
tgggcaaggt 
agccttgatg 
cgggtgctga 
agtgtccatg 
gctcagagaa 
aggaagtgct 
tgatccagaa 
ggctgctgag 
cgtcctgggg 
tttctggact 
tgtgatgtgt 



gagccaccgt 
gtctgtgggc 
ctcacacctg 
acttcgagac 
aaattaaccg 
gagaattgct 
ccagcctggg 
ccaaagcgca 
gacagatctg 
gtgtgctgac 
atgatttgtt 
actcaggctg 
tagtgatcct 
ctggctgatt 
aggctggttt 
acaggcgtga 
ttttttatga 
ataaaaagaa 
ctcagcatat 
tatgctacgc 
ggcgctgtga 
atgtttgctt 
agtggctcat 
ccagaagttt 
ttaataagtc 
ggaggattgc 
ctccaacctg 
tatatatata 
atttataaat 
atatttataa 
aaatatataa 
tatttaaaat 
tacacattta 
aaggtgtctt 
agaatgtgtt 
gctggttctg 
gcaggtggat 
atctctacta 
ctacttggga 
ccgagatcgc 
aaaaaaaaag 
atcagccgcg 
taccccgagg 
accaacagaa 
gacttggaca 
cccgtgcgtc 
ggcaggctgt 
gtggccagga 
ggctgtatca 
gagcctgcgt 
accgagtgac 
gtctcaggct 
ctgactttga 
ccacagctta 
gccctgcctt 
tgagttttct 
gcagcagccc 



25320 

25380 

25440 

25500 

25560 

25620 

25680 

25740 

25800 

25860 

25920 

25980 
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26100 

26160 

26220 
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2.6340 

26400 

26460 

26520 

26580 

26640 
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26760 
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26940 

27000 

27060 

27120 
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27300 

27360 

27420 

27480 
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27600 

27660 
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27960 
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aatccctgcc tgtcgcctgt tctgctaggg aaggaaggaa gacttcagga tggcaggaca 28740 

acagaaagag gtccaggttt tagagcaagg gcaggtcaaa cttagaaaat tctggaatga 28800 

ggatgtgcat ttcctcttct ggatctgcta aaagaagagg gaaggagggg ctgctggggg 28860 

aggagcccag agccgagttt acatccggat cccgcaaggc ctcccctgcc ctgaggtctt 28920 

gttttgtgat gtgcttgtgt ccatcctggt ttctgccgtg tccccaacat ccggccaagc 28980 

ttaggtggat gttccagcac acactcaccc tgtctgtgca cctgtttttg tgtccgtaag 29040 

tgggtattta ctcaccttac gagtgagcca ctgtgggaat tcagggaggt ggcgcagtga 29100 

ccacccctgg agggatatgt gtgtggcagg ggtcgagggt ctcgcccttc cctgcttcct 29160 

gcgcgtggct ttctccagga cggggagggc tgagctgaag aggtggggac agttgcgtcc 29220 

ccccgccacc cactgtcctg cggtgagagc agactcactg agcctgccct tctcccttgt 29280 

gccttccagc tacatctact ggaccgagtg gggcggcaag ccgaggatcg tgcgggcctt 29340 

catggacggg accaactgca tgacgctggt ggacaaggtg ggccgggcca acgacctcac 29400 

cattgactac gctgaccagc gcctctactg gaccgacctg gacaccaaca tgatcgagtc 294 60 

gtccaacatg ctgggtgagg gccgggctgg ggccttctgg tcatggaggg cggggcagcc 29520 

gggcgttggc cacctcccag cctcgccgca cgtaccctgt ggcctgcaag ttccccaacc 29580 

tggcaggagc tgtggccaca cccacgactg cccagcagcc tcaccctctg ctgtgggagt 29640 

tgtccccgtc cacccctggg tgcctttgct gcagttatgt cgggagaggc tctggtgaca 29700 

gctgtttcct gtgcacctgc tgggcactag gtcccagcta atccctgtgc caggactcta 297 60 

atttcaccct aacacacatg gtggttttca ttgctgggga agctgaggcc tgagcacatg 29820 

acttgcctta ggtcacatag ctggtgagtt caggatcccc cagagatacc agggccagca 29880 

ctcgatcccc acccagccct gaaccccacc atgtgctggg attgtgctgg gagtgtccac 29940 

acgcctggga ccccagggct ggtgctctca tctccttttt ccagatcatg agaatgaggc 30000 

tcagggaagt ttgaaaaaaa cctatcccaa gtcacacagc aacaggagca ggatttgaac 30060 

ccagaaaagg ggaccgcaca ctctgttctg ctagagtagt tagctgtcct gggtgatatg 30120 

gcaggtgaca ggggcaactg tgcttaacaa aggaaccccc atcccccctg ccaagttggg 30180 

agactagaag gtcaggggca gaagctctga agggccaggt gcagtggctg acacctctaa 30240 

tcccagcact ttgtgaggcc aaggcgggca gatgatttga gcccaggagt tcaagatcag 30300 

cctgggtaat gtagtgagac gccatctcta caaaaaaatt ttttaaaaat tagctgggca 30360 

tggtggttca tgcctgtagt ccaagctact tgggaggctc aggtgggagg attgcttgag 30420 

cccaggaggt tgaggttgtg gtgagctgtg atcatgccac tgcactccag cctgggcaat 304 80 

agagtgagac cgtctccaaa aaaaaaaaaa gaagaagaaa aagaagctct gaggctccaa 30540 

gtccccaggc accccttggc ttgagggcag acaagggagg agagggtcac ctgggcagcc 30600 

ctgacttttg tcccctggca aagggacctt cagtgacctt ggccctagga gagcctctga 30660 

gcacgtcagc catgtcgaac cgctcaggaa gggcagcaag aatttggctt ctgacctctg 30720 

cctctcctac tcgccatctg cactgggtgt ggttgtgccc attttacaga tgaggaggct 30780 

ggggcatcga ccagctgaat gccttgtccc aggtactgcg taggcagagc tggcagttga 30840 

accccgtgtc ctggttgtcg ctgggggtgg gctgcaccct gacttgtgag gccagtagca 30900 

aggtttgcac gtgacttcgt gaccgtcacc cagctctgca gcacatcccg tgacccagct 30960 

catccaggcc gcatgcaaac ctgttgccag gcgagaaacc agtcaccgca cagctgtggt 31020 

tgcctgaaat gattaagctc attaatcacc ccggagtgag gacagactca gatgaaaacc 31080 

agcaaaagcc ctggaaactc atgtgaccct gccaatgagg gcggccatgt gcattgcagc 31140 

ctggccgtca ctcctcggta cgtgttttgg acttaaacgc tccggatgtt tactgagtgc 31200 

ttgattaata acatggaagg cctggtctca ttgctgtggg agtgaaggat gcacagccag 31260 

gcctgacatg atgagaacaa gaacctggag tctcgctgcc tgggtggtaa tcctggccct 31320 

gccacttagc aactgtgtga ctgtagccag gtcacttaat tttgctagat cctgcctgcg 31380 

cttcagtgga tcttgctggt tttccaaggt ggccaaacac tttaaggcat tcatgtggtc 31440 

gctaggctgc agggttgaac cctggctcac cccgcagggc gccgtgtgct ctgtggcctg 31500. 

gctgtgcctt tgctgacacc gtgcccgtgt gtgttcatgc aggtcaggag cgggtcgtga 31560 

ttgccgacga tctcccgcac ccgttcggtc tgacgcagta cagcgattat atctactgga 31620 

cagactggaa tctgcacagc attgagcggg ccgacaagac tagcggccgg aaccgcaccc 31680 

tcatccaggg ccacctggac ttcgtgatgg acatcctggt gttccactcc tcccgccagg 31740 

atggcctcaa tgactgtatg cacaacaacg ggcagtgtgg gcagctgtgc cttgccatcc 31800 

ccggcggcca ccgctgcggc tgcgcctcac actacaccct ggaccccagc agccgcaact 31860 

gcagccgtaa gtgcctcatg gtcccccgca cctcactc'cc tcgttagatc aggctggttc 31920 

tgggagctga cgctgaaagg agcttctcat ctggggttcc tgggtgtaca tagatggttg 31980 

ggtaggttgt gcactgcaca agctgcatga tgctacctgg gggtccaggt ccaggctgga 32040 

tggacttgtt gcttcatcag gacatagata aatggccaaa actcctcagc tggaaggtcc 32100 
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tgggcaggat ctttgggtgt gaaaaccagt cacaggggaa gggtgcttgc tcatactgcc 32160 

agcacagtgc tgagtgcttt ccatagcgct cgtttactcc tcaagcctgg agggtgggga 32220 

gtagcatggt cccatttcac gtacaaggaa cccgatgcac agagaggtgt ggcaacccat 32280 

ccaaggccat acaactgggg tgggttgagc cggggttgac tgtggcaggc tggctcaaga 32340 

gtccctgctc ctgaaccctt gccaggcagc ctggcatcag ctcggggaat ttttgccctg 32400 

acccttggaa gcaagtgggc ctctttgttc tcatgtcagt gatgagaaga gtgactttcc 324 60 

tatggcccct ctggagtaca ggtgtttcct gttggcgggc tcttccccca tgacatcagc 32520 

agcgagctgg ttatgattcc ctacgcagaa cttgatagtt tataaagctc tttgtcatcc 32580 

aggccccgtt ggagtctcac gcagacctgg tcgcaggcgg ggctggtctt gcctgtccca 32640 

gctgcatgga tggggaactt gaggcttgca aaggttaagg ggctgttcga ggcccaggct 32700 

ggcaggagat gggcctgggc cagagtctgg gacttcccat gcctgggctg tctttggtcc 32760 

tgttgctcac catccctccc tggggccatg accttagaga gccaaatgga ggtgcaggta 32820 

acccacggca aggaggggtt gccatgactc agagtccccg tcctgtggcc ggcagtacct 32880 

ggtgcaacga cttggatttc agaccagcca ctgtagcccg ctgacggtgc gctcgaagtg 32940 

ccacagcttc tgaagccagg caggactcag gccaggagac tctgttagct gttgagaggg 33000 

agaggccaac ggatgttctg gttctgctag agagctggtt cttcggatcc tggtaccagt 33060 

gcactgagag gaggcccagc ttgattctgg ggctgccttg tggtggcatg tgctgctcac 33120 

tgacaccctc gaggagtgtc ttctctcggg cttgttgact gtgcccggtt ttccgcagtt 33180 

cactggtgca cacataggca catagcaaac cgcacacaca gtcgtgggta tgagtttcac 33240 

tacattccac caccagtgtt cactaccatt acctgccttc cgtcttaagt gttcatcatt 33300 

taaaaataaa tttattgggc tggacgcggt ggctcatgac tgttatccca gcactttggg 33360 

aggctgaggc gggcagatca cctgaggtca ggagttcaag accagcctgg ccaatatggt 33420 

gaaactccat ctctactaaa aatacaaaat tagctgggca tggtggggca tgcctataat 334 80 

cccagctact caggaggctg aggcaggaga atggcgtgaa cccgagaggc agagcttaca 33540 

gtgagcccag atagcaccac tgcagtccag cgtgggcaac agtgcgagac tccatctcaa 33600 

aaaaaaaata aataaataaa agaaaaataa atttatgatc tatttcaaaa ataacacatg 33660 

tactttgaaa cagcagagac acatatgaca cggagaatga aattccccat agcgcacccc 33720 

caagagacag ccctggtccc cccgtctttc ccgtggacct ccagcggggc agatgctgag 33780 

ccgcctgttg tcgagtggcg tgctatcccg tcctccagct cctctgtggc ttacagacac 3384 0 

ccacctgcag ccctgtcttt gcctcctcta gcgcccacca ccttcttgct gttcagccag 33900 

aaatctgcca tcagtcggat gatcccggac gaccagcaca gcccggatct catcctgccc 33960 

ctgcatggac tgaggaacgt caaagccatc gactatgacc cactggacaa gttcatctac 34020 

tgggtggatg ggcgccagaa catcaagcga gccaaggacg acgggaccca ggcaggtgcc 34080 

ctgtgggaag ggtgcggggt gtgcttccca aggcgctcct cttgctggtt tccaggctgc 34140 

tgcccctgtc cttagcagag ggaggaaaca gaggatggct ctgggtgaat gatgacttgg 34200 

gcttcgatta tgtagtcaca gggtatgacc ctgagatgcg tggaaccccg agactgtgat 34260 

tatatgtaga aactgggttt ccccgttgtt taagtagtca tggtggggtc agaccccaca 34320 

ggacttttgt cttttcaaga aagaaaatgg tcgtgtgtca tgcaggggta gttggtactg 34380 

gttaatccag gtttatcctt tattttgtgg gaactgtaca gtcatttctg ctacaatgct 34440 

gtatatgctc ttctgaaaga cacctatgca aaatcgcaca gtaaaaatga cacaactcat 34500 

agggaaagcg gggccagggc acagccctca aaatctccat caatgacatg taagaaaaga 34560 

gaggaacctg ggaaatagca aagtgccttt tgcacattaa atggttagct atatcccaca 34 620 

atactgtgca ttcgtaaacg ttaatgctgc aataaatacg gcacttcacc ttgggaagat 34 680 

ctggagttgg cttatgagtg tggaagggtg tagcgcatga gtttttgtga aacactggaa 34740 

ggaggattgt gggaaatcaa atggaaagtt ctcaccccag gcgtggagaa gagtgggtca 34800 

tggccccagc agtgagccca gggaggtcag agacggaggt gtgtgtgtgg gtgtgaccct 348 60 

gcgcagttcc ctgccggctg tagttttttg cattcgctta atgtttctcg tggaggaaat 34 920. 

tgtgcatgag caaatgtgaa accgtgctgt gctcaaattg tcctaataca tcattgcatt 34 980 

ggaacagatt ggcttttttt tttttttttt tttttttttt tttttgagat ggagtctcac 35040 

tctgtcacca gcctggagtg cagtggcatg atcttggctc actgcaacct ttgcctccta 35100 

tgttcaagtg attttcctgc ctcagcctcc tgagtaactg ggattacagg catgagccac 35160 

cgcggccggc cagatttgca tttttgaaac aactgctagg ctgggcgcgg tggctcacac 35220 

ctgtaatccc agcactgtgg gaggccgagg caggtggatc acctgaggtc aggggttcga 35280 

gaccagcctg gccaacatgg tgaaaccccg tctctactga atatacaaaa atcagctggg 35340 

tgtggtggcg ggtgcctgta atcccagcta ctcaggaggc tgaggcagga gaattgcttg 35400 

aacccaggag gcagaggttg cggtgagccg agatcacacc attgcactcc agcctgggca 354 60 

acaagagcaa aactccatct caaaaaataa aaaatagaaa aacaagtgct gtagcggaag 35520 
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tgagcacttt gcggagtcag gcttgtgtgg cctgttccac aaatgatgtg ctcacggtgg 35580 

cctcaggccc acctggagtc tgcagcatgg ggcacaacag gttcattagt gtagaattcc 35640 

aggacaggcc tggctcctaa gcagccttct tttacaaaaa ctgcagagcc cgcctgtatc 35700 

ctagcacttt gggaggccga agtgggtgga tcacgaggtc aggagttcaa gaccagcctg 357 60 

gccaacatgg tgaaacccca tctctactaa atatacgaaa attagctggg tgtggtggca 35820 

cgcgcctgta gtcccagcta ctcgggaggc tgaggcagaa ttgcttgaac ctgggaggtg 35880 

gaggttgcag ggatctgaga ccatgtcatt gcactccagc ctgggcaaca gagcgagacg 35940 

ccatctcaaa aaaaaaaaac ctacagagcc acacggcctc tttctccacc gagtgttggt 36000 

gtgggagctt gtgttattgt ggtgaaatct tggtactttc ttgaggcaga gagaggctga 36060 

gcgcctggag agactttcac atgggtcgcc atgtccgccg tcggtttcgc tgttgtgctc 36120 

cccatctgaa ggctggtgcc gtccagacag gctggacgcc cctttccacc agatccttcc 36180 

tcccgcagca gtttctagtt acgttgtact gtgaggtctg tgtccttggt tgatggcaaa 36240 

agtcagccga attgaaattc agagccatgc ctggctccct ggagcttctc tcctgggcag 36300 

ctgtgatcat tgcctctgct gtggtgtggg tggtggaaat ggattccttt catcttgctt 36360 

gctacaggtg actgtcacgt ggagtccttt ggagagaggg acgtgttaat tgatggatgt 36420 

ggctcccatg ctgagaaagc tcctgggcgt acattgcctt agagtttcat tggagctgcg 36480 

ttcttttatg gtgtctgcta ggcagaagtg atgaagactt ggaagaaaac ccagaaggtt 36540 

ttccacttaa tttggaaaat gtgcttttcc cctcctgtgt cttttgctaa ggtccagcct 36600 

cctgcagcct ccccgctctg tggactctgg ctttgattct ttattaggag tccccctgct 36660 

cccccaaaag atggtgtcta aattatcatc caattggccg aggttttgtt ttctattaat 36720 

tgtttttatt ttttattgtg gtaaatttat ataacataaa atttgccatt ttaattgttt 36780 

tgttattgtt gtttttgaga cagggtctca ccccagtgcc caggctggag tgcagtggtg 36840 

cgatcatggc tcactgcagc ctcagcctcc agggctccag tgatcctctc acctcagcct 36900 

ctctagtagc cgggactaca ggcatacact accacatctg gctgattttt tgtatttttt 36960 

ttttattgta gagacccgct atgttgccca ggctggtctc aactcctgga ctcaagccat 37020 

cctcccacct caccctccca aagtgctggg attacaggca tgagccacaa cacccagcca 37080 

ttttaatttt tttttttttt tttgagatgg agtctcactc tatcgcccag gctggagtgc 37140 

agtggcgtgg tatcaactca ctgcaacctc tgcctcccag gttcaagcga ctctcctgcc 37200 

tcagcctcct cccgagtagc tgggattaca ggtgcccatc actatgcctg gctaattttt 37260 

gtatttttta gcagagacgg ggtttcacca tgttggccag gctggtcttg aactcctaac 37320 

ctggtgatcc gcccgcctcg gcctcccaaa atgctgagat tacaggtgtg agccaccgtg 37380 

cccggccttt ttttgttttt gagacagggt cttgccctgt cacccagact ggagtgcaat 374 40 

ggtgggctct tggctcactg cagcctccgc ctcccaggct caagttgtgc acctccacac 37500 

ctggctaact gtattttatg tagagacaga tttcaccatg ttgcccaggc tgggcttgaa 37560 

atggactcaa gcagtccacc cacctcagcc tcccaaagtg ctgagattac aggcgcgagc 37620 

caccgcaccc agcccatttt acctattctg cagttgacag ttcagtggca ttcagtcagt 37 680 

tcacgaggta accatcactg ccattcatct ccagactact tcaccttctc ggcagatgtc 37740 

cgaaactgtc cgcattgaac acactcctca tctccctctg acagccacca ttctactttg 37800 

tatctctctc tgccttctct aggtacctca tgtaagtgga attataccaa tatttgccct 37860 

tgtgtgactg gcttctttca tgtgacatgg tgtcctcaag gttcatctgt gttatagcct 37920 

gtgtcagaat ttccttcctt aaagcctgaa taataacccg ttgtaaaggc tgggcgcggt 37980 

ggctcacacc ctctaatccc agcattttgg gagtccgagg tgggcagatc acttgaggtc 38040 

aggagtttga gaccagcctg gccaacatag tgaaaccctg gctctactaa aagtacaaaa 38100 

ttagctgggt gtggtggcgc gcacctgtaa tcccagttac tcaggaggct gaggcaggag 38160 

aatcgcttgt acccgggagg cagaggttgc agtgaaccaa gattgtgcct ctgcagtcca 38220 

gcctgggtaa cagagtgaga cttcctgtct caaaaaaaaa aaaaatcatc ggatggatgg 38280 

acggaccact tcttgttatt tatccatcca cgggtgctag gtttcttcca cctttggttg 38340. 

tcgtgaataa ggccactatg aacatttcct tccgtggtga aggttttgta ctagtgagga 38400 

aaaggcgtgt ttgtggtgtt gcataggatt ctggtaagaa agtttgcact aaccataagt 384 60 

atttgtacta cattaaaatg aaagctcagg ggccgggcgc ggtggctcac gcctgtaatc 38520 

ccagcacttt gggaggccag ggcgggcgga tcatgaggtc aggagatcaa gaccatcctg 38580 

gccaacatgg tgaaaccccg tctctactaa aaataccaaa aaactagcca ggtgtggtgg 38640 

cgggcacctg tagtcccagc tacttgggag gctgaggcag gagaatggcg tgaacccggg 38700 

aggcggagct tgcggtgagc cgagatcgct tcactgcact cgagcctggg caacagagca 38760 

agactccgtc tcacgcaaaa ctctgtctca cgcaagactc cgtctcaaaa aaaaaaagag 38820 

ttcagggttt atgaaactgg ccagccgcgt aaagtttgct gtgttgtttt tgtgcccggg 38880 

aggagtgtgg ccagggtgtc acgtcacaca gtacacgttt ctcagatggt ggttctccag 38940 
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actgctgtcc caaagtctgt ttttgcatct ggttcccaca gacccaccct ccacggtgag 39000 

cctgattttg gccagggtag ctggaatctt gcttgtcttt cagcccggca gctgtaccag 39060 

tccagggtcc acagctagtg gcttttagga aggaatttgt tcagttggct ttgacacatg 39120 

gccccctagg gtccacagct ctgtagtgat gtggatgttg ttatctacaa agacacatga 39180 

tccttcgtgt ccagatgaaa gtgatgatgt ctttgcagct gcccagcaag gctgtgtgtg 39240 

tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tggtgtgtgt gtggtgtgtg tgtgtgtatg 39300 

ggggagggag gcaccctttc catctggggg tgtgtgtgtg tggggtgtgt gtgtgtgtgt 39360 

gcgcgtgtgt gtggtgtgtg gtgtgtgtgt gtgtatgggg gaggcaccct ttccatctgg 39420 

gtccaagaga ctgggcctgg ggaagacgct tctttttatc tacttagaga ctttgtttta 39480 

tttgtatttt tttgagacag ggtctcactc tgtcacccag gctggggtat ggtgatatga 3954 0 

gcatagctca ctgcagcctc ggcctcccag gctgaagcga tcctcccacc tcagccttct 39600 

gaatagctgg gactgtaggc gtgcgtcacc atactgagct attgtttttt ttgtttggtt 39660 

ggtttaattt tttttgatac agatggagtc ttgctatgtt gcccagacta gtctcaaact 39720 

cctgaactca agtgattctc ccacctcagt ttcccgacat tctgggatca caggtgtgag 39780 

ccactgctgt ctccctgttt tattaactgc tgaaagacct agataaagaa agtctgaaaa 39840 

gacttactat cagagcacca tcctaagatg attccctctg actcaatgga gagggagggg 39900 

agcttttcct tcaggcctgg gtggcaggag cccaggtgct ccaggcccca tttgccccag 39960 

gccaaatcac tcgggaactt ggatgcagct gtctttcagg gtaacccaaa ggaaccagat 40020 

ccccgcaggc agtaggcttc tgggctgtcc tctcctccta cgtcagctca gtaagagccc 40080 

ttcgaaggga tgctgtgtcg gaggccccaa aagcccaggc tcatccctga gatgcacagg 40140 

gtgggctggg cttaggcagc gctcgagcat ctcctggacg gtgaccccag agagtgtgga 40200 

gacggagagt ccttgagagt cactgagaga cgtggctgcc ctgccttccc aagaggggct 40260 

ctgagtcatt ccccacactc acctgcccct acccaccctc acctggcccc cagcctcacc 40320 

tacccccaca tctgtaccga tccctttacc cgcaccttcc ctacccaccc tcacctcccc 40380 

tgtaccttca cctcccccac tcacccgccc ctgcaccctc acctgtcccc caccttcacc 40440 

taacccccac cctcacctgc cctcccctca cctggcctcc ttccgttggg gaaggggttg 40500 

taaggggcgg cccccaaact gtctgtcctg gtgccctgca gagaaaacag tacgtgaggg 40560 

ccgcagtcca aaagcttgag tcctggaagg tggaggagac agggatgtgt tgggaagggc 40620 

cccatggtct tggatccctt ctcgactgtc aatggggcct tcatgggagc gccagtctag 40680 

tgatgcacag ctgggtgccc ggcgggtggc tgaggaggcc taaagtccga ggcggcaaga 40740 

gctcttccag aggctgttgt cctaatcgct ctggcatact caggcgggca cgtagttagg 40800 

agctgattgg agaggagaga cccccacacc aatactggga tttgactttc aggctaaact 40860 

tgagaagtgt ggcctctgct gtcctgccag agctctccag ccagtgccca gggctctcca 40920 

gccagtgccc gggggtctcc accagtgccc gggggtctcc gccagtgcca ggggtctccg 40980 

ccagtgccca ggggtctccg ccagtgctca ggagtcttgg tttctttgtc ttacagccct 41040 

ttgttttgac ctctctgagc caaggccaaa acccagacag gcagccccac gacctcagca 41100 

tcgacatcta cagccggaca ctgttctgga cgtgcgaggc caccaatacc atcaacgtcc 41160 • 

acaggctgag cggggaagcc atgggggtgg tgctgcgtgg ggaccgcgac aagcccaggg 41220 

ccatcgtcgt caacgcggag cgagggtagg aggccaacgg gtgggtgggg gtgctgcccg 41280 

tccaggcgtg cccgccgtgt cttatgccga atgccagcct ctcacaggct ggggagactt 41340 

tccacctggg gatccaatgg gtggctttcc agggtcccaa aagcaaacac aggtttttca 41400 

cagcccgtcc gggaaagcag aaagccccaa ggggctggaa ggggaaaggg ggagctctgc 414 60 

tgagaggtta caaggcagcg ctggccgacg ggagttgcag ttgataggtt ttgtatcatc 41520 

cttgttaaac ttgaaccctg tgcagaaatc ccttccacgg catgggggct gcctgttgac 41580 

tcgctcctgt tccaccacag ggagctcctg ggcttcttcc tcccagaggc ccccgacgct 41640 

cccacctgtt ggtcgtcaga gcttctggtt ggtgggaagg cacccaggac cttgaggtct 41700 

ccagagagaa aagccaggga aagagggaga ccgaaaccca tgtgacatga aactcaggct 41760. 

ccaaactgag cacgggaacg tttggggaca ggagcgcgat ggccttcctc agatagctgg 41820 

ggggctggca tgaagacggg agctacagcc agcacaggtc ctgggccggg agcccagaga 41880 

ttgagccctg actctgtcac ttactggcca cgtgaccttg ggcgggtggc atagcctctt 41940 

ggagactcag tttcctcatt ggtaggagtg acggccacag tggtgcggcc tctgcagcac 42000 

acggggggct cggtgggcgg aagccccggg tctataaggc ggctgtgcag gagccagccg 4 2060 

agctggtctc ccaacagcca gggctccggg gtccttagca gctgtggggg gcctgcacct 42120 

gtttcccatg gctgctgtca gaaattacca gaagccaggt ggctgagagt aatggacact 42180 

tgttctctca cagttcctga gggctgaagc ccgagatcga ggtgtgggca gggccctgcg 42240 

ccctctgaag gctctgaggg aacctttggg cttctggtgg ctccaggcac cccttgactt 42300 

gtggtcctgt cactccagtc tctctgtctg gctgcacatg gcgtggcctc ttctgtacca 42360 
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ttgaaggaca cttcagttgg atttagggcc taccctcacc cattgtggtc gtatcttgat 42420 

ccttcatgac atttgtaaag accctgcttc caaataagct cacattctga ggttctgggg 42480 

tgagcgggaa tttggagagc attgttcaac tagtatagaa tgtgacctgt cagcctcggg 42540 

cagccctgag aggcaggggc tttccacagc ccagctgggt gccctgggct ccgtgctgtc 42600 

cgaggagacg ccatccccac acccgtcctt cacccgccac cctcccgcag gtacctgtac 42660 

ttcaccaaca tgcaggaccg ggcagccaag atcgaacgcg cagccctgga cggcaccgag 42720 

cgcgaggtcc tcttcaccac cggcctcatc cgccctgtgg ccctggtggt agacaacaca 42780 

ctgggcaagc tgttctgggt ggacgcggac ctgaagcgca ttgagagctg tgacctgtca 42840 

ggtacgcgcc ccggggcctg ccctaaccgc agacacccgg ccttcattgt cagtaatggc 42900 

agcagctgcc acattgtccg agacctgccg tgagcccagt gccgcgccag gggctttgtg 42960 

tgtagcgtgt tttgtcctca cactgacagc tgtaggctgg ggttctgagt gagccccaca 43020 

gggcagaggc agaaaatgag tctcagagag ggtgagcgag ctgcttgggg ccccacagca 43080 

ggagatggag caggactgca gcctagcctc tgcccccagc acctgcgcaa gaagctgctc 43140 

tgctctggac tgtgttaggc tgcgagggct ggagagaaat gagagttggt gcttagagag 43200 

ggggcgcagg tccccatggc ttttcctctt atgatgaggt agatgggtga agggaggggc 43260 

catgcttgca ggggccagtg accgaggccc gccgttggaa ctgatggcct tcatcccgag 43320 

cccagcccag gtgggagcag ggctttccga gggcttgtct tgggtcggcc tgcttccagg 43380 

gactctgctg cagctcccac ccctgtccaa agcatggaat cccccaggct ccctggcagt 434 40 

cctgtcaacc tctgtcctcc caagctgagt gtggggcaag ttctggaggt cagcactgct 43500 

caggggggcc cacgggctgc ttgcaggggc caaccgcctg accctggagg acgccaacat 43560 

cgtgcagcct ctgggcctga ccatccttgg caagcatctc tactggatcg accgccagca 43620 

gcagatgatc gagcgtgtgg agaagaccac cggggacaag cggactcgca tccagggccg 43680 

tgtcgcccac ctcactggca tccatgcagt ggaggaagtc agcctggagg agttctgtac 43740 

gtgggggctg gcagtggggt gggcagggtg gcctctaaac ccgacccctg gaggaggctg 43800 

gaggccagtg caagatcctg tgtggcctca gccaggcggt ggtctctgcc agatgccaac 438 60 

tgttgcccgc tggggttcag cgacatgtcc gaatgtcccg aggcctctga ggttgttttc 43920 

ttttgccgca gaacaaatca ccacgaacag cgttttaaga caacaccaac tctttttttt 43980 

tttttttttt tgagtcagga tcttgctctg ttgcccaggc tggggtgccc tggtgcaaac 44040 

acagttcact gcagcctcga cctctgggct taattaagtg aacaccttgc ctcagcctcc 44100 

caggtagctg ggactacagg tgggcaccac cacacctggc taattttttt ttgtagagac 4 4160 

ggggtttccc catgttgccc aggctggtct gcaactcctg ggcacaagct atctgcctgc 4 4220 

tgtggcctcc caaagtgcta ggattatagg tgtgagccac tggcctgaca acacccacgg 44280 

attgtctctc agttctgtaa ggcaaagtcc aggcacagcg tggctcacct gggttctctg 44340 

ctcagggtct cacggggcca gaatcaaggt gtcaggaacg ctgggccctc agcggaggct 44400 

ctgtggagaa attagcttcc ttgctcactc agcaggtagc agttgtggga tcgaggttct 444 60 

gttttctctc tggttattgg tcggggacca ctctcagctc ctagaggcca ccacaggtcc 44 520 

ttgccccgtg gccctctctg cctcagcagt gggggctccc tgcgtcagtc cctcccacac 44580 

cttgagtctc tctgatttgc ttctaaaggg ccctgtgatt cggctcagcc acctttagat 44 640 

taggttagcc tcccctttga tagactccaa gtcggctgat taataacctt aatcacatct 44700 

gcagaatccc ttctgccaca taaggtcatg acgccgtgct ggggactggg gtgggaaatt 44760 

acggggtcat ttaggattct gcctgccact gccttgctgt gtcccagggc ttgggggagg 44820 

ggcctccaca gctgggacca cagtccttcc tcccctccat ggtaaccatc tgaggattac 44880 

ttgagaccag cctgggcaac atggtgagaa cccatcccta caaaaaatac aaacaaaaag 44940 

ggaccaggct gggcttggtg gctcatgcct ataatcccag cactttggga gaccaaggtg 45000 

ggctgatcac ttgaggttgg gagttcgaga ccagcctgcc caacatagtg aaatcccgtc 45060 

tctactaaaa atacaaaaat tagctgggtg tggtggcagg cgcctgtatt cccagctact 45120 

ggggaggctg aggtgggaga attacttgaa cctgggaggc ggaagttgca gtgagccaaa 45180. 

attacgccac tgcactccag cctaggcaat agagtgagac tccgtctcaa aaaaaaaaaa 45240 

gggccagggg tggtagtgac aaagagaccc tatcccaaaa aaaccgaaca ctgaatcctt 45300 

gagactgagt aaggacactg tgaaattttt ctgggtgggg cagggaacag agcgtcttct 45360 

gtcatttctt ccacctgggt gtggtcagct ctccctccaa gctgcctcct cttcttctca 45420 

ttgtccgggt gttggacaca tttggttaac tggatagaat aacgcgagtt cccagggact 454 80 

tggtccattt gctattttat tttattttat tttattttat tttatttatt tatttattta 45540 

tttatttatt tattgagatg gagtttcgtt tttgtcgccc aggctggagt gcagtggcgc 45600 

gatctcggtt cactgcaacc tctgcctccc aggttcaagt gattctccta cctcagcctt 45660 

ccaagtaact gggattacag gcacccacca ccataccagg ctaatttttt tgtattttta 45720 

gtagagacgg gttttcgcca ttttgcccag gctggtcttc aactcctagc ctcaggtgat 45780 
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ccacgcacct cggcctccca aagtgctggg attacaggca tgagccacca cgcctggcac 45840 

catttgctat tttaattccc atgtgtatta gtgtcccacg gctgctgtaa caaatgacca 45900 

caaactggat ggcttaaagc aacagaaatg gattccccca atgtgctgga gaccagaagc 45960 

ctgcgaccaa actgttggga gggctgtgct tcctctgggg gctccaggga ggatctattt 4 6020 

gttggccctt ccagtgctgt gggtgccagc gttccacact tgtggatgcg ccgcctcaac 4 6080 

ctctgcccat cttcatgtgt ccatctcctt tgtgtctgcg tctttacctc ttcttcttgt 4 6140 

ctgtgttgcc tcttataagg acgtttgtca ttgggtttag ggcccaccca aatcatccga 4 6200 

gatgacctcg tcttgagatc cttaacctgc aaagaccctt tttccaaaaa aaggttatgc 46260 

tcacagattc taggccttaa gacatgggtg tatctttctg gggggcacta tccaacccct 4 6320 

tatacaatga aagacgggaa gagggccagg tgtggtagtt cacgcctgta atctcagcac 46380 

tttaggaagc tgaagcggga ggatcacttg agcccaggag tttacaagta gctaggcaac 4 6440 

atgatgagac cccatttcta caaaaagtga aaaaaaaaaa aaaaaaaaaa aagccaggtg 4 6500 

tggtggctca cacctgtaat cccagcactt tgggaggctg aggcaggcag atcacgaggt 4 6560 

caggagattg agaccatcct ggctaacacg gtgaaacccc gtctctacta aaaatacaaa 4 6620 

aaattatggc cgggcgcagt ggctcccgcc tgtaatccca gcactttggg aggccgaggt 4 6680 

gggtgaatta caaggtcaag agatcgagac catcttggct aacacggtga aaccccatca 46740 

agatcacaag gtcaagagat ggagaccatc ctggctaaca cggtgaaacc ccgtctctac 4 6800 

taaaaataca aaaaattagc cgggcatggt agcgggcgcc tgtagtccca gctgctcggg 4 6860 

aggctgaggc aggagaatgg cgtgaacccg ggaggcggag cttgcggtga gccgagatcg 46920 

ctccatgcca ctgcactcca gcctgggtga cagagtgaga ctccgtctca aaaaaaaaaa 4 6980 

aaaaaaaaaa aaaaaaagaa aattagccag gcacagtggc aggtgcctat tgtcccagct 47040 

acttgggagg ctaaggcagg agaatggcat gaacccggga ggtggagttt gcagtgagcc 47100 

gagatcatgc cactgcgctc cagcctgggc gatagagcaa gactctgtct caaaaaaaaa 47160 

agccaggcat ggtggtgcat gcctgtagtc ccagctactc aagaggctga ggcaggaggg 47220 

ttgttcgacc cacggagatc aaggctacag tgagccatga tcgcaccact gccctccagc 47280 

ctgggtgaca gagtgtgacc ctgtctcaaa gtaagtaaat aggaggagag acaagtgggc 47340 

agttcagact gatggtatgg gcacagtaga gactggtgca gacaggctgg cctgtgatgt 47400 

caagcaactt ctgtaattgt ttccggcatc catttgtgtg tcaatttccg tgtcagtagg 474 60 

aagactctgt aggctgccaa gaggaataag tgggaggatc ctcccagaga ggccgggcct 47520 

gcaggagggc cagttctcat gagttctcat ttggccccta ccctccaggc tgtggttctg 47580 

aggtgggaga cagagcctga cctctgtttg tcttgttttg tctttgcagc agcccaccca. 47640 

tgtgcccgtg acaatggtgg ctgctcccac atctgtattg ccaagggtga tgggacacca 47700 

cggtgctcat gcccagtcca cctcgtgctc ctgcagaacc tgctgacctg tggaggtagg 47760 

tgtgacctag gtgctccttt ggggtgatgg acaggtacct gattctctgc ctgctaggct 47820 

gctgcctggc atccttttaa aatcacagtc cctgtggcat ccagtttcca aagctgattg 47880 

tgtcttcctt tgccctcctt tcttttctac tatgtgcatt cggtgctatg aattttcctc 47940 

taagtactgc gtttcctgca tctcacaaat tttgttacat tttcattttc aggtagtttg 48000 

aatattttta cacttctcct gagatgacat ctttggctca tgtgttattt agaagtgttg 48060 

cttagtttct aaagagttgg ggcttttcca gctgtctctc tgcaactgat ttctaattta 48120 

attctactgt agtctgagag cttattttat atgatttctg ttattttaaa tgtgttgggt 48180 

gtggtgtttt tgttgttatt gtttttgtgt ctttttgttt tgttttgctt cgtttgtttt 48240 

gtttttgaga cagtgtcttg ctctgtcact caggctggag tgcaatggcg cgatctcagc 48300 

tcaccgcaac ctctgcctcc cgggttcaag tgatcctctt gcctcagcct cctgagtagc 48360 

tgggattaca ggtgcacgcc accataccca gctaattttt gtatttttag tagagacggg 48420 

gtttcaccat gttggtcagg ctggtctcga actcctgacc tcgtgatccg cccacctcgg 48480 

cctcccaaag tgctgggatt ataggcgtga gccactgtgc ctggccatta ggtgtgtttt 48540 

atcacccagc atcatgcagt ttatcttggt gaatgttctg tgtactcttg aaaagaatgt 48600- 

ggattctgct gttgttgggt ggagtgttcc agaaacatca attagatcca gttggttaat 48660 

agtgctcatc aggttgtctc tatccttcct tcctgactgc ctgcttgagc tgtcagttat 48720 

tgacaggggt gtggagtctc caactctaat ggtggatttg tttatttctc ctagtagttc 48780 

tatctttttc tctccttcta cccttgatcc tcttctcccc ctagggcttc ctggtgttag 48840 

tggtgggaga gtggggtagt gaagaacctg gactttaggg ccaaagaggc cagggttcaa 48900 

atcctggctc tgtcacttcc cagttgagtg accctggctg gtgcctgaat ctctgtgagc 4 8 960 

ctccacttcc tcctctgtga aattgagagc acttacctgg caggctgtca tgggcatcaa 4 9020 

gtaacagggc actccacctg gaccctgaca cgtgatgcac aggaatgcca gctgctatgc 4 9080 

catgggtgtg gcagtagtaa taaagtgacc atctgtatcc tcaccacagt gaagcctgtc 49140 

cagggctttc tctcctatgc ccccatgcct ccaggtggcc ttggatcctg ttggttctgt 49200 
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gctctgctca gcgacctttc tcccgtggga gttcctgggg gttcagcttc atcctacaga 49260 

cagcagcaca cactggctgt gcaccctttt tttttttttt tttttttttt tgagatggag 4 9320 

tctcgctttt ttcgcgcagg ctgaagtgca gtggtgtgat cttggctcac tgcaacctct 4 9380 

acctcctggg ttcaagtgat tttcctgcct caccctccca agtagctggg attacaggct 49440 

cccaccacca cgcccggcta atttttgtat tttcagtaga gatggtgttt caccatgttg 4 9500 

gccaggatgg tcttgaactc ctgacctcag gtgatccgcc cacctcagcc tcccaaagtg 4 9560 

cagggattac aggcgtgagc caccacaccc ggagtgccgg ttgtttttag cagtttgtct 4 9620 

tgttcctgga gagactggct cctgcccagg agctcgggga gtagggccgc ggggtgctgc 4 9680 

ctcacacctc gagtttggcc gtaagcagag gggacatttt gtgactgtcc ccctcctgag 4 9740 

cttcccagca gcttttctcc aagttacagc ccaaaagctc aggtggattt gcaacccaac 4 9800 

ggtgtctgtg cacctcccac tgatgcccga actgccctgg ccaagaaacg gggccgtcag 4 9860 

aacgctgcac taactgcagc cttgggcctc catgccagag gccatgccct tccatccacc 4 9920 

accccctggc ctgggccctg ggccctcctg gctcgggaac tccaggcccc ttcctcacgg 4 9980 

ctcgagagac gtgtatttac cgcacaggtg cttgtcattc tcttgtggcc tcttctccag 50040 

ggagatcaca gaaggacagg gcctcactga ggtctcggac atggaccctt tgatagtggc 50100 

aggagccagg ctgggcaaga ggcggccaca gtcacctcag cagtgccatc accaccgcca 50160 

ttcagccctt ccctgagccg ggcgcgcccc tggctctggc cccagtgtcc cagttacagc 50220 

tcacaggagc ttgtggtgcc cagcggctgc ttctgattga gagtcgaggt cggaggcttt 50280 

gggaggctga gaggctgctc ggtttcacaa ctgctgaggg agacttgggc tccatctcag 50340 

gtatgcccca tgtcgccctc aacctccagc caccggtcct ccgtgtcccc catggccagg 504 00 

cacggcttgc agacatctgt cgttggctcc tctcagccgt cgtgggctga ccctggcacg 504 60 

tcctcctgtg gctgagccca gtggggacag ctgcttcctt ttattaccct agaactctcg 50520 

tctttgatca ggccccctcc cctatgccac acagtccctg tcactcgggt gagcccagta 50580 

gtcatgggga aggcctgcgg gttccaaaca tccaaaggct tgcgtgcagc atgacagctt 5064 0 

gaaaccgatg ttttttacct tgatcagatt tcagcttggc gggggctttg ctcagctttc 50700 

agtgaggcct gggccgattt cccagcatcc cctcctgagg ccagcctctg tttcctgtga 507 60 

ttttctgcac aaagtgggag ggaggagtcc taggaaatgg ggggccacct cgaagcctag 50820 

gcctcctctg gcttctctgt gccagtgccc ccacgctttg tgtctgtgtc cccagcccat 50880 

gggactctgc tattccctga gtgctgccgc atgcccagcc cgcactgagg acgtggagcc 50940 

ccgaggggca ggatggcctc catggtcaca cgtaggaagt ggcctccacc ctccgatgat 51000 

cctctccctc ctccctttca gcgccctccc cgggggtgtc ctcagccctc ctgcctgtgc 51060 

tttgtcccgt cttctgcagg cgcctgggac gtgctgacag gtcctctgcc ggctcctgcc 51120 

ttgctatgcg cacgctggtc accacagagg cctggccctt cttctgtagc agtcccacac 51180 

ccgcaacagg tgtggctgct gaccacctgc tttctgcccc tctggtcctg aggagggcgc 51240 

agtgggcact caggcgtggc tgagcagatg tgtgttgccg ggaggaggaa ggactgctcc 51300 

agtcagggct gaatttccca cccggagcat ttctgctgta tttggtgtag cgcctgctgc 51360 

ttaaagctct gattcccagt tggcaccctt tcccttctgc attgaaaaac atacggatgc 51420 

atgtcttctt gcagtgaatg tgtattctcc cagcctctct tctgggttgg ggctggaggt 514 80 

ggagcggcac acaggagccg cagcgatgga ggatgtgcgg gtgcagcacc ccgtacagca 51540 

gggatgccaa acccgcgctg agtccctctc aacttctgct ttgaagccca gtcacgccat 51600 

tgcctgggtt ttgctgggcg gggctgcgtg tgatgttctc ctctgtccct cccccagagc 51660 

cgcccacctg ctccccggac cagtttgcat gtgccacagg ggagatcgac tgtatccccg 51720 

gggcctggcg ctgtgacggc tttcccgagt gcgatgacca gagcgacgag gagggctgcc 51780 

ccgtgtgctc cgccgcccag ttcccctgcg cgcggggtca gtgtgtggac ctgcgcctgc 51840 

gctgcgacgg cgaggcagac tgtcaggacc gctcagacga ggcggactgt gacggtgagg 51900 

ccctccccgt caaggctctg ccaagaccct ggccctgccc tccgggatac gagcttgggg 51960 

ctgcctccgg cctcacagga gtaggggctc tgaaaacctt tgcttgcagg gagattgcca 52020- 

agtctgtctt ttaggcccaa caaggaaaac tctgcagttc cacccatcct gtcccaccag 52080 

gtagtgtggc ttgaaggcag actgtgaggg tctatctcac cttcctgcat taggtcagga 52140 

gtttcacaga aacctgaggc acattcaggg gtgggctgca gaggtccatg gctcacaccc 52200 

tggaaaatcc gcccccaaaa gacagtgctg tctccactga ccagtctgtg ggatagtgct 522 60 

taagcctgag tggtttctat caacatgtag aatcaggagg tataaagaga tttgctcagg 52320 

catcctgggc cctctctgac cagcaggatc ttcctttaga tcttgacagt gaaacacatc 52380 

tcttctgtgc cccctgtgag ttttctttca ttcattcatt cattcattca ttcattcatt 52440 

cattcattcg agacagagtc ttgctctgtc acccaggctg gagtgccctg gtgtaatctc 52500 

ggctcactgc aacctctgcc tccagggttc aatcgattct cctgcctcag cctcccgagt 52560 

agctgggatg acaggtgcgc accaccatgc ctggctaatt tttgtatttt tagtagagac 52620 
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agggtttcac catgttggcc aggctggtct cgaactcctg acctcaggtg atccgcccgc 52680 

ctcagcctcc caaagtgctg ggattacagg catgagccac cgcgcccggc ctgagttttc 52740 

cttttatgaa ggacctgctt ggttggttgc ctgccacatg ttgtcagcac catgggccca 52800 

ggactgctga ggagctgttg atgccctcgc tctcccagag ccaccggctc tgttagataa 52860 

ttcacatgca gtctggccac tgtcctacgt cctcattcac aaagagcaga catttcgtag 52920 

aagatgaggg cctgggagta acctccctgc atgtttttct ataaaggcat agtggttaag 52980 

tccttccagc tcattgacca ttggagaatt ttatggaggc tgtagactag gggctggtaa 53040 

actaagggcc caggggccaa atccagcctg ccacctactt ttgtaaataa agttttcttg 53100 

gtgcacagcc atgcccattc attcatttgc acaatgtctg tggctgcttt catgccaaaa 53160 

gcaagagaac tgagtggtta tgctggagac ctacggcctt caaagcccca gacctcacgt 53220 

ctggcccttg acagacagag cttccccagc cctgctgcgc atcctggccc agcatgtgct 53280 

gtgtgtgtga tttcagcttg caggagccgt ggttaggaat tgtccctgtg ttggtccatt 53340 

ttgcattgct atgaaggagc acctgaggcc gggtagatta tgaaggaaag aggtctgtct 53400 

ggctcatggt tctgtaggca gcaccagtat ggcacccgca tctgctcagc ttctagtgag 534 60 

gtctcaggaa gctttgactc atggtgaaag tcgaagcggg agcaggtgca tcacatggtg 53520 

agagagggag caacggagag agagagagag cgcctctccc tcttgccctc accttgagag 53580 

gagatgccag gctcctttaa gtaaccagct cccatgtgaa ctcacagtga gagcccattt 53640 

gctactgcgg agagggcacc aggcatctgc tcccatgacc caaacactgc ccaccaggcc 53700 

ctacctccaa ccttggggtc atattttatt ctgttctatg ctatgctatg ctatgccatg 53760 

ccatgccatg ccatgctatt cctattctat tatttgagac agaatctcgc tctgttgccc 53820 

aggctggagt gcagtggcat' gatcttggct cactgcaacc tccacctccc aggttcaagc 53880 

gattctcctg tatcagcctc ccgagtagct gggattacag gcacacacca ccacacccgg 53940 

ctaatttttg tattttcaat agagatgggg tttcaccatg ttggccaggc tggtctcaaa 54 000 

ctcctggcct caagtgatcc acctacctcg gcctcccaaa gtgccatgat tacagatgtg 54 060 

agtcactgcg cccagtgagg gtcacatttc cgttgagatt tggaggggca gacgttggag 54120 

ccatctgagc cccctcgtcc cgctctagct tctcctcccg tgtgccccgc ggtgctggtg 54180 

gcaggccctt acgccggttc tggctgcatg ctctgttcca gaagctttct tccctgcttg 54240 

gttaccagaa aatcatccca tccattacaa ggacagggtc cccttatctc ccattcccag 54 300 

ggcaggacac cgggggcagg gcaggtgggg aactgagcaa gttctctggg ggcaggcgtg 54360 

gctatggctc cctctgggtg ggcgtctggg gaggggtgga ggcagccgtc agcgccctgg 54420 

cttgctcttc ctccctggcc agagactgtg gccttgtgct gctcccgtgt gggctgcctg 54480 

cacctccagt gggttgtgct ccctcccctc ccctcccctc aagctctgct gagcaccact 54 540 

gccttccaca gcccccactc tcgggaggcg aggctcctcg tggccattcc tgtccttggc 54 600 

acccaccccc ccaccaacct ggtagagcct tgggcggggt ctgttactcc ttgcatggcg 54 660 

tagacctccc cacagtaggc acctgacaca tacctcctgg ggggcaggca ggaggtgcgt 54720 

tgaggtctca gccctggcag tccctcccct gcgtggcata ggcctcgcca cagggtcatc 54780 

gagggtgggt ggagactgta ctagaccact ccccgctggt cctagaaagg gtcccatctg 54840 

tctgctctct gtttggagtc cagaccttgg ttgctgtgcc ctgcatggtg ggctgggggg 54 900 

caccctccag cctctctgag tgcatggcct ctccttgcag ccatctgcct gcccaaccag 54 960 

ttccggtgtg cgagcggcca gtgtgtcctc atcaaacagc agtgcgactc cttccccgac 55020 

tgtatcgacg gctccgacga gctcatgtgt ggtgagccag cttctggcac ggggaagggg 55080 

cgtccgggct gggttccccc aggaacgtgg agtttagggg aggagacgtg cctttccagc 55140 

ggggctgggg gctgtgtggg agactcaggc ggctgggagg ctccttgcgg gaggcaggga 55200 

agcctttccc agggcagcgg ccaggaggac agactgtgag ctgtgggctc ggcggctaca 55260 

gagtctgcct cagtgggcgg ggctgatggt gtccaggtgc ctgcagcacg cacccaccca 55320 

cgggaccttg ctgagcagcg tctgtcaggc agcaagatta cccgagggct gcagtggtcc 55380 

tgttccctgg cagcttactg tctggctgag gaggagtgat gttcacatat gcacacatgt 55440- 

catgtgcaca cacatgtaca tgacaacatc ccacatgctc ctcaaatagc atgacctgta 55500 

cagtcacgga tatagggcct aggggatagg aggccaagac agtcagggaa gactttccag 55560 

aggcagtggc tcctgaaagg ctgtctgatt caggcaggaa gggagctgag ttcagatagg 55620 

aagtagcaat gagtcattgt gtctggggac atggccactc cttcgctgca gagggacctg 55680 

ggctgagagc tcctctctta tggctgcagt cgggagagaa gtctgttggg gggagaaggg 55740 

ggcttcctca agggactccc tgtgcccttt ggcaccttcg tgccaggtca ggcttgaggc 55800 

ctgaaggcag tggtgggggc caccaagggt cgcctcctct gctgggcaag ttcccagtct 55860 

gacgggcctg tgccgtgggc cccagctgtg ggggcgctgt tgatgcgcag ccaggcctcg 55920 

ccgccagagc ccgcacgctt ccattccgct gacttcatcg acgccctcag gatcgctggg 55980 

ccggccctgt gggagagtga atgtggcttt tgccaaagtt gagtctggag cctggaaact 56040 
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tccctatggg cagccttgat agtggagtgg cccaaggagc ccacccagcc gaccctgccc 56100 

ctcccgtggc tggtgggcgg caccaggggc tgcctggctt tgctcgttca ccaacatcac 56160 

ctgggctggc cagggcgcgc tcacttctgc caccaccgag ggccctgggc gaaggagtga 56220 

ataccaggct gccttggcag ggatgtgttg agggctgtgg ggagtcggac agcggcgggg 56280 

gtcagaggag gaggagggtg caccgtgcag gctgaagggc cacgttaccc tgaggttggc 56340 

caggctcccc aggcctagcc tcccagctcc cccactttct ccccaccctc caccagtggc 564 00 

aaagccagcc ccttcagggc gcacggtgtc tgcccccaag gagggcccat tccgttgggg 564 60 

ttaatgttgg ccacctcttt ctgtttgtct ctggcagaaa tcaccaagcc gccctcagac 56520 

gacagcccgg cccacagcag tgccatcggg cccgtcattg gcatcatcct ctctctcttc 56580 

gtcatgggtg gtgtctattt tgtgtgccag cgcgtggtgt gccagcgcta tgcgggggcc 56640 

aacgggccct tcccgcacga gtatgtcagc gggaccccgc acgtgcccct caatttcata ,56700 

gccccgggcg gttcccagca tggccccttc acaggtaagg agcctgagat atggaatgat 567 60 

ctggaggagg caggagagta gtctgggcag ctttggggag tggagcaggg atgtgctacc 56820 

ccaggccctc ttgcacatgt ggcagacatt gctaatcgat cacagcattc agcctttccc 56880 

actgagcctg tgcttggcat cagaatcctt caacacagag gcctgcatgg ctgtagcaac 56940 

ccaccctttg gcactgtagg tgtggagaaa gctccttgga cttgaccttc atattctagt 57000 

aggacatgtg ctgtgttgtc cacaaatcct catgtaccct agaaatgaat gtgggggcgg 57060 

ctgggctctc tccagagctg aaggaatcac tctgtaccat acagcagctt tgtcttgagt 57120 

gcagctggga tttgtggctg agcagttaca attcctacgt ggcccaggca ccaggaacgc 57180 

aggctgtgtt tgtagatggc tgggcagccg caccgcagag ctgcaccatg ctggtttgta 5724 0 

tcacatgggt gaccatggta tgtctaagaa ggtggagtcc ctgtgaggtc tgcaggtgcc 57300 

cccacagctc caggccacct tgaggattgc ctctgcctgc ccagccctga gttccctctc 57360 

ccctgtcctg tcccactgtc accccaagcc ggcctcattg ggagcctgtt ggatggcagg 574 20 

gtatagatgt aacctgattc tctctgggga gcggggttat ctggcttctc aagagctcct 574 80 

aggagcccac agtggtggca ccatcacagt cgcagcagcc cccagagaac gcggccctgt 57540 

ctgttcctgg cgtgctctgt gctgccccgc ctgggttccc tgccccagtc gcaggcccct 57600 

tggaggaggt accatgtgtc tcccgtttca cagatgagcc ccggggagct cactctagta 57660 

gtggccagag aggcctgcgg ctcagggagc ggggcacatt ' tccaacagga cacaccgccc 57720 

tggtctgagt ctcgtgggta gtgggagcag aggagagcgc cctatgtctg tggggcggct 57780 

tggctgagcc tggaagccac ctgacctccc ccgtcccttc cctgccaggc atcgcatgcg 578 40 

gaaagtccat gatgagctcc gtgagcctga tggggggccg gggcggggtg cccctctacg 57900 

accggaacca cgtcacaggg gcctcgtcca gcagctcgtc cagcacgaag gccacgctgt 57960 

acccgccggt gaggggcggg gccggggagg ggcggggcgg gatggggctg tgggcccctc 58020 

ccaccgtcag tgctggccac cggaggcttc ccgggttcct gggggctgtg ccaccgcctc 58080 

tgaggcatgc ttgctttctt cccttttcaa acccttctgc ttccttcttt aatgacattg 58140 

ttgattgtgg ataatctgaa aactacacaa aaatataaag agccaaaatc tcacccaaat 58200 

ccacctccta gagtggctgt tgggctccgt cagcatccag gcggccgtct gtgttccgca 58260 

cggcccagcc catcgatagc cgcctgcacc aggcctgtct gccctctgtg agcctcccca 58320 

cagggttccc tccacaaaca ccctgttctc ccacccaggg ctggctgctt cctggaaaac 58380 

agctggatgg ttttgtgcat gacagacaaa cacagggtga ttttcgtggc taaaatactc 58440 

cctggagctt ttggcagggt gaggggctgg ctccagctga gccacgcctt gagtgaaatg 58500 

actgtgagga gaataaactg ccgctgccct ccaggatcac tggggctggc tggggagaac 58560 

ccccgtttct gggagcacag tcccaggatg ccaaggcgag cttggtgccg agatgtgaac 58620 

tcctgagtgt aaacagcggg ggctgacttg acatgctttg tatgcttttc atttgttcct 58680 

gcagctgtat gcccctaagg tgagtccagc ccccttctgc ttcctctggg gcctcgccag 58740 

tgagccccac cttgctgggg ctggttcctc ctgcccttct gggtatccct cacatctggg 58800 

gtcttgtctt cttgttttct ttttcttttt tttttgagac ggagtttcac ttttgttgcc 58860- 

caggcttcag tgcaatggtg tgatctctag gctcaccgca acctctgcct cccaggttca 58920 

agcagttccc ctgcctcagc ctccctagta gctgggatta caggcatgtg ccaccacgcc 58980 

cagctaattt tgtattttta gtagagatgg ggtttctcca tgttggtcag gctgatcttg 59040 

aactccctac ctcaggtgat ccgcccacct tggcctccca aagtgctggg attacaggcg 59100 

tgagccaccg cacctggcct ttttcttttc ttttcttttc ttttttctga gacagggtct 59160 

cgctctgtca cccaggctgg agtgcaatgg tgtcatcatg gctaactgca gcctctacct 59220 

tctaggctca agcaatcctc ccatctcagc ccctaagtag ctaggactgc acgcatgcat 59280 

ccccatgccc agctaatatt tacatttttt gtagagatga agtttcacta tattgcccag 59340 

gctggtctcc aactcctgga ctcgagcgat cctcctgcct cggcctcccc aggtgctggg 594 00 

attacaggcg tgagccaccg tgcctggcct ggggtattgt cttcttatgg cacctgactg 594 60 
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tggtgggccc 
tgagtgttct 
gaagccgaga 
ttaaccagcc 
acgaattctc 
gatggcactg 
cccacttaaa 
tcaccacagt 
ctgtttttaa 
tcaaaagata 
catccctggg 
tgtacatatg 
gatttatgtg 
agtgctgcct 
ttctttcacc 
tctgtcaccc 
ggctcaagcg 
tcatgccagg 
tagtcttgaa 
tacaggcatg 
catctgaaga 
ttctgagcat 
cccatctccg 
tgagcatgaa 
ctcacaacca 
acagccgtca 
ctggagggca 
ctgccaagtg 
aaaacggtcc 
gttaagatct 
taagaacctt 
gagatgagcg 
cccgtctcta 
ctgaagcatg 
cattgcactc 
aaaagaaaag 
atatatatat 
atcataaaaa 
tcctgttgcc 
cgggttggag 
accatgcctg 
ctggtctcga 
ttacagacgt 
gttttacgtt 
catccagctt 
ggaggcagaa 
gggactggaa 
taggccaggc 
gatcacctga 
ctaaaaatac 
ggaggctgag 
catgccactg 
aacaagaagt 
agggcacagt 
gaggacatgc 
ggtgttgagg 
ccatttcaac 



tgggaaggaa 
ggaggcccca 
gtttaaacac 
aggagtgcta 
ttttcctgtt 
agcagccagc 
aaagggtaca 
taattttaga 
attagcaaag 
cagaagagtc 
aggcatggcc 
caagcataca 
cagttgtgag 
gccatcttgc 
ttattcagat 
agtgggagtg 
attctcctgc 
ctaattttta 
ctcctgagct 
agccactgcg 
gtcaacattg 
ttcggaggct 
ggaggaggtg 
ccactggcct 
ccttctactc 
cattcttggt 
tcatggtcct 
gccctctgaa 
ctggcaccct 
tcatttatat 
tgggcccagc 
gatcacttga 
gtaaaaattt 
agaattgctt 
cagcctgggc 
aaattatcaa 
atatatattt 
acaacttatt 
caggctggag 
tgattctcct 
gctaattttg 
actcgcgacc 
gagccaccat 
cacatcttta 
gtttcttggc 
actctatgag 
tagcagaaat 
atggtggctc 
ggtcaggagt 
aaaaaaatta 
gcaggagaat 
tactccagcc 
tcaacttgaa 
ccatcccagg 
caggaagggc 
ctggacggga 
gtggaaactg 



gtagcagaag 
gggcctggct 
ccactgccct 
ggaacccaag 
ctcccttggg 
taatttttta 
gttcagtggg 
acattttcct 
ttattttgca 
tgcagtccaa 
tcctgcagtg 
agagcgtgga 
aaagagcaca 
atgacttcca 
tgacataagt 
cagtggcaca 
ctcagtcccc 
aattttttgt 
aaagtgatcc 
cctgggctga 
agccttgcct 
cccagctgcc 
ctattgtttt 
gggtcgttcg 
tgcttccccc 
ggctgaacag 
ctctcctgct 
agggacagtc 
ttctctccac 
ttattttatc 
gtgatggctt 
ggccgggagt 
aaaaattagc 
gaacatggga 
gacagagcga 
tctcctcttt 
tttttttttg 
ttcacactct 
tgcagtggcg 
gccttacctt 
tacttttagt 
tcaggtgatc 
gcccagccca 
attcttctgg 
tgatggctta 
aatttgaaca 
tgaacagtga 
acacctgtca 
tcgagaccag 
gctgggtgtg 
cacttgaacc 
tgggtggaag 
gggaaaaatg 
ctggcgctga 
ttggctggtg 
gaggccagct 
agctcttctg 



agggttcttc 
ttgtttaggg 
tcttccctgc 
cagggccccc 
agctgggagg 
aatcacttta 
ttttagtgta 
gcccctaaaa 
tcctttaaat 
agagactccg 
tgtttcttct 
ctttgttttc 
gacccattta 
ttcctatcat 
gttttttgtt 
atcacagctc 
tcaagtagct 
ggaggtgagg 
tcctgcctca 
catatgtgtt 
tttgctgcta 
gtgtgcaccc 
cctcattttg 
gttggtaggc 
gcaccctgaa 
cactccttgt 
gcggttgaac 
catcttttct 
ctgtctaata 
atgaatgatg 
gcagctgtaa 
ttgagaccag 
cgggtatggt 
ggcggaggtt 
gactctgtct 
tatggcatat 
gttatgttca 
ctctcttttt 
caatctcagc 
cccgagtagc 
agagacgggg 
cacccacctc 
cactctcttt 
gatgtaatta 
tgggtggcgt 
gagaaagttc 
gatgtacaga 
tcccagcact 
cctggccaac 
gtggcgcatg 
tgggaggcag 
agcggaactc 
ccgtattgtc 
tccacgggct 
ttcaggagcc 
aggggctcat 
cttctctttc 



ttggtttcct 
acaaagggaa 
tcctgctgct 
gagcacacag 
atcttaatca 
ttgtttaacc 
ttcacagatg 
gaaactctgc 
atatgttcat 
cccccatgac 
atgtcccccc 
caagccagaa 
tcctctgcct 
aagcaagaca 
tgttcttgag 
actgcagcct 
cagatggcag 
cctcactaaa 
gcctcccaaa 
ttcgtaagcc 
acgatgtata 
tgcctagagc 
caacaaggag 
agtggggcca 
gtttgttctg 
caggcgtggc 
cttggctgtt 
cagcagaggg 
tagagtaaaa 
taagcatcat 
tctcagcact 
cctggccaac 
gatcccagct 
gcagtgagcc 
caaaaaaaaa 
atatatatat 
gaaaggcctt 
tttttgagac 
tcactgtaac 
tgggattata 
gtttctccat 
ggcctcccaa 
cttaacgtcc 
gatttgatga 
gaattagtcg 
cgtctacagg 
gaactctaag 
ttgggagacc 
atagtgaaac 
cctgtaatcc 
aggttgcagt 
tgtctgaaaa 
tttccctttg 
ggagaggggc 
caggccaggt 
gtaggatatg 
ttcttcactg 



ggacagtaac 
ctggtaacca 
gcaacccagc 
caggcagctc 
ggcaataaga 
atatgactca 
tgtgcaaccc 
atgaagccag 
ggtacaaaat 
gccaagcagg 
aggggtcatc 
gataattgta 
ggtttccccc 
ctgataacga 
acaaacttcc 
caaactcctg 
gtgtgcacca 
tttcctgggc 
gtggtaggat 
cgaaagatag 
aaagctgctg 
tctaccgtaa 
gctgaagaac 
ggccatccaa 
ttttgaggac 
tgggccccca 
tcaaccactc 
ccacactggc 
atggtatcat 
tttgtgtgtt 
ttaggaggct 
atggagaaac 
acttgggagt 
gagatcgcgc 
aaaaaaaaag 
atatatatat 
ccctgctctg 
agagttttgc 
ctccgcctcc 
ggcatgcacc 
gttggtcagg 
agtgctggga 
tcctcctttc 
gcaaggtggg 
gggtctatca 
cttattacca 
aatgcaggaa 
aaggcgggtg 
cccatctcta 
cagctactcg 
gagccgagat 
aaaaaaaaaa 
ttatgtcacc 
tgccccagaa 
caggtcaaga 
aggggtcggc 
cattaagatt 



59520 

59580 

59640 

59700 

59760 

59820 

59880 

59940 

60000 

60060 

60120 

60180 

60240 

60300 

60360 

60420 

60480 

60540 

60600 

60660 

60720 

60780 

60840 

60900 

60960 

61020 

61080 

61140 

61200 

61260 

61320 

61380 

61440 

61500 

61560 

61620 

61680 

61740 

61800 

61860 

61920 

61980 

62040 

62100 

62160 

62220 

62280- 

62340 

62400 

62460 

62520 

62580 

62640 

62700 

62760 

62820 

62880 
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caataccgct tgggaagcag gtatttccct tcctataaag gatggttggg agcctgagtg 62940 

ttgggagaaa gtgtagccgc tgagttacta acaactaggg ctgccgtcaa gcctatgggg 63000 

aaagagagaa gaggacattt ggaaggagag agatcaagct gtggcaccct gggagaggac 63060 

cacagaaaag aggccagtga gggggttccc cggtggcatc tgaaggtgtg gcccaaccag 63120 

gaggtccaga ggctgccagc cgagtggccc aggagaggga acctcacagg ggctgagtgg 63180 

gacccaagcc ctatccaccg tcctaaccac ccacatttct cgggaacaag acctcccaca 63240. 

gtggcctccc cggcagtgga aatagccaaa ctggcaacat ggactttctt caactgcccg 63300 

ggcgatgctg cctcagtgcc ccagggcagg caggaagctc ccacacccat tctggaatga 63360 

ggggttggag gaaggctgag ctgagcaaag gacccatctc tgctctggtt ggtggggagg 63420 

gagcccatta tacaagagac ccctcagggc tcagtgaggg gtgacagaga cttggggagt 634 80 

agtggctgtc actgcagagg tgagagggtt tggagagaag gtacatgcct ttttggccac 6354 0 

attgagtagc acctggtagc cagttagtaa cgtgtattgg ataaacaaaa gattaaacgg 63600 

atgcaaaaaa aaatgttggc tttgcttctt tttacccaaa cctcagttcc ctcaagtaga 63660 

ttctgggaac accccctacc tggctggact gttgtgaagt ttaaataagc caggttaact 63720 

tcacctcctc ctttaagaca cagctcagac actgcctcct ccaagaagcc ccctctggct 63780 

tcctgtgtga atatgacggc cctctgggct ctagggtatc ttagaacaat gcttccttat 63840 

ggctttggaa ccccgctgtc tcctggattg ggagcaaatg. caggggagga gccacacctg 63900 

actaatctct gggtctccca gcacataagt ggcataaggg cagggctgtg cccgcttcag 63960 

gcacttactg aaggatgtac ttggcagagg gtaggcagcc ggcggatgag cccctcactc 64020 

tccccagctg actgcgtggg cgggaaaggc gggttcagga gacccagcct ccctgggctg 64 080 

tcaccacctc tgcacatcca gccccattga tcaagggttc aatttttggg gtcctgttgg 64140 

gaggccagga gactctctcc aggcacttct tccaggtctt tgtgttaggg tgtgtgtgtg 64200 

tgtgtgtgtg tgtgtgtgtg tgtgttgttt gttttatttt atttatttat ttatttattt 64260 

atttatttat ttatttattt tgagacgcag tctcgctctg ttgcccaggt tggagggtgg 64320 

tggcatgatc tcggctcact gcaagctccg cctcccgggt tcacgccatt ctcctgcctc 64380 

actcttcctg agtagccgga ttacaggcgc acgcaccatg cctggctaat tattttgttt 64440 

ttttagtaga gacagggttt cgccacgttg cccaggctgg tcttgaatcc ctggcctcaa 64 500 

gcgatccgcc cgcctcagcc tcccaaagtg ctgggattac aggcgtgagc caccgtgccc 64560 

gcccagccta ggggtacatg aaactttttt tttttttttt ttgagacaga gtttcactct 64 620 

gtcctcaggc tggagtgcag tggcgtgatc tcggcgtact gcaatctccg cctcccggtt 64680 

caagcgattc tcctgcctca gcctcccgag tagctgggat tgcaggcacg cgc.caccaca 64740 

cccagctaat ttttgtattt ttagtagaga cgggctttca ccatgtggga caggatggtc 64800 

tcgatctcct gacctcgtga tccgcccgcc tcagcctccg aaagtgctgg gattacaggc 64 860 

ctgagccacc gtgcccagcc atgatgtttt gatacaggca tataacgtat aataatcaca 64 920 

tcagggtaaa tgatgtaacc atcacatcaa gcatttatcc tttgtgttac aaaaaaaaat 64 980 

ctaattatac tttcctactt attctttttt tttttttttt ttgagacgga gtctccctca 65040 

gtcgcccagg ctggagtgca gtggcatgat ctcagttcac tgcaagctct gcctcctagc 65100 

tctgcctcct gggttcatgc cattctcctg tctcagcctc gcgagtagct gggactacag 65160 

gcgcctgcca ccgtgcccgg ctaatttttt tttttgtatt tttggtagag acagggtttc 65220 

accgtgttag ccaggatggt ctcgatctcc tgacctcata atccgcccgt ctcggcctcc 65280 

caaagtgctg ggattacagg catgagccac cgcccccagc ctatttattc ttaaatgtac 65340 

aataaattat tgttgactcc agtcaccctg ctgtgctacc aaatacggat cttcttcatt 65400 

ctatctaact gtatttctgt acctgttaac catctctcct ccacctcacc ccccaaaccc 65460 

actacccttc tcagcctctg gtaaccatcc ttctactctc tatctctatg agttcaattg 65520 

tattaatttt tagctccccg gccgggcacg gtggctcacg cctgtaatcc cagcacttca 65580 

ggaggctgag gcaggtggat cacgaggtca ggagtttgag accagcctgg ccaacatggt 65640 

ggaaccccat ctctactaaa aacacaaaaa ttagctgggc gtggtggtgg gcgcttgtag 65700- 

tcccagctac ttgggaggct gaggcaggag aatcgcttga aactgggagg cagaggttgc 65760 

agtgagccaa gattgcgcca ctgcactcca gtctgggtga cagagtaaga ttccatcccg 65820 

aaaaaaaaaa agtttagctc ccacaaataa gtgagaacac gtgaagtttc tctttctgtg 65880 

cctcgcttgt ttcacttaac ataatgacct ccagttccat ccacgttgtt gctttgttat 65940 

aaatgacagg atcttggtca ggcgcagtgg ctcatgcctg taatcccagc actttgggag 66000 

gctgaggtgg actgatcatg aggtcaagag atcgagacca tcctggctaa cacagtgaaa 66060 

ccccgtctct actaaaaata caagaaatta gccgggcgtg gtggtgggca cccatttccg 66120 

ccccttctcg ggacgctgat gcacgacata ttacccatcc ccggaagact aatcctcccc 66180 

cactctatat tgtacctctt cctttctcct ccacgcgatt ccccgagtaa cccgtcttcc 66240 

ctccctcctc ggattacgct cacctttccg cttcaatcac gttgctccgt ccccttcccc 66300 
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attcgtacca ctcctcactt tcgtcttcct acccccacta tcccttttcg tcctctctat 66360 

tccttactta ctcctccccc ttctcttcat acttcattcc ctccgctctt cccactcgcg 66420 

ctcccacttt cacctagttg ccctcaccta cgttgccatc tcgccccttc ttcagctctc 66480 

ggcctctcac ccatctgtcc tctctcttac ctctctcctc atctcgctca gacatctctc 66540 

tagactatcc ctcactttac cttctcagtc gtcttcttcc tatccttcgt tctccatgat 66600 

cttcacgtcg ccatctcttt tcgccccttt catatgtctc tcttcatgtt ctcactatca 66660 

ttctcatgat cactatcgtt ctcactactt atcactcccc tctttcttca tcaattcctc 66720 

tccgtcattc tcgtctctct cttacaaccg ccttccttgt gctatctaac tcaaccatgc 66780 

ctctcctact ctctctctat cgcccctcca tcgcttatgc atcctcttct attgcacacc 66840 

cgcccctcca tcgcttatgc atcctcttct attgcacacc gcccctccat cgcttatgca 66900 

tcctcttcta ttgcacatcc tcttctattg cac 66933 



<210> 12 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 
<400> 12 

ctgagcggaa ttcgtgagac c 21 

<210> 13 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 13 

ttggtctcac gtattccgct cga 23 



<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

* 

<220> 

<223> Primer 
<400> 14 

ctcgagaatt ctggatcctc 20 



<210> 15 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 15 

ttgaggatcc agaattctcg ag 



22 



<210> 16 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 16 

tgtatgcgaa ttcgctgcgc g 21 

<210> 17 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 18 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 18 

gtccactgaa ttctcagtga g 21 

<210> 19 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<4O0> 17 

ttcgcgcagc gaattcgcat aca 



23 



<4O0> 19 

ttgtcactga gaattcagtg gac 



23 



<210> 20 
<211> 21 
<212> DNA 



<213> Artificial Sequence 
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<220> 

<223> Primer 



<400> 20 

gaatccgaat tcctggtcag c 



21 



<210> 21 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 21 

ttgctgacca ggaattcgga ttc 23 

<210> 22 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 23 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 23 

cuacuacuac uactcgagaa ttctggatcc tc 32 

<210> 24 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 22 

cuacuacuac uactgagcgg aattcgtgag acc 



33 



<400> 24 

cuacuacuac uatgtatgcg aattcgctgc gcg 



33 



<210> 25 
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<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 25 

cuacuacuac uagtccactg aattctcagt gag 33 



<210> 26 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 26 

cuacuacuac uagaatccga attcctggtc age 33 



<210> 27 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400>- 27 

aactggaaga attcgcggcc gcaggaattt tttttttttt ttttt 45 



<210> 28 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 28 

aatteggcac gag 13 



<210> 29 

<211> 9 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 29 

ctcgtgccg 9 
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<210> 30 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 30 

gtacgacggc cagt 14 



<210> 31 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 31 

aacagctatg accatg 16 



<210> 32 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 32 

* 

ccaagttctg agaagtcc 18 



<210> 33 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 33 

aatacctgaa accatacctg 20 



<210> 34 
<211> 57 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 34 

agctgctcgt agctgtctct ccctggatca cgggtacatg tactggacag actgggt 



57 



<210> 35 



<211> 56 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 35 

tgagacgccc ggattgagcg ggcagggata gcttattccc tgtgccgcat tacggc 56 

<210> 36 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 37 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 37 

gccgtaatgc ggcacaggga ataagct 27 

<210> 38 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 36 

agctgctcgt agctgtctct ccctgga 



27 



<400> 38 

gagaggctat atccctgggc 



20 



<210> 39 
<211> 20 
<212> DNA 



<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 39 

acagcacgtg tttaaagggg 20 



<210> 40 
<211> 163 
<212> DNA 

<213> Homo sapiens 



<400> 40 

actaaagcgc cgccgccgcg ccatggagcc cgagtgagct cggcgcgggc ccgtccggcc 60 

gccggacaac atggaggcag ctccgcccgg gccgccgtgg ccgctgctgc tgctgctgct 120 

gctgctgctg gcgctgtgcg gctgcccggc ccccgccgcg gcc 163 



<210> 41 

<211> 419 

<212> DNA 

<213> Homo sapiens 



<400> 41 

gccccacagc ctcgccgctc ctgctatttg ccaaccgccg ggacgtacgg ctggtggacg 60 

ccggcggagt caagctggag tccaccatcg tggtcagcgg cctggaggat gcggccgcag 120 

tggacttcca gttttccaag ggagccgtgt actggacaga cgtgagcgag gaggccatca 180 

agcagaccta cctgaaccag acgggggccg ccgtgcagaa cgtggtcatc tccggcctgg 24 0 

tctctcccga cggcctcgcc tgcgactggg tgggcaagaa gctgtactgg acggactcag 300 

agaccaaccg catcgaggtg gccaacctca atggcacatc ccggaaggtg ctcttctggc 360 

aggaccttga ccagccgagg gccatcgcct tggaccccgc tcacgggtaa accctgctg 419 



<210> 42 

<211> 221 

<212> DNA 

<213> Homo sapiens 

<400> 42 

ccccgtcaca ggtacatgta ctggacagac tggggtgaga cgccccggat tgagcgggca 60 

gggatggatg gcagcacccg gaagatcatt gtggactcgg acatttactg gcccaatgga 120 

ctgaccatcg acctggagga gcagaagctc tactgggctg acgccaagct cagcttcatc 180 

caccgtgcca acctggacgg ctcgttccgg taggtaccca c 221 



<210> 43 

<211> 221 

<?12> DNA 

<213> Homo sapiens 

<400> 43 

tccctgactg caggcagaag gtggtggagg gcagcctgac gcaccccttc gccctgacgc 60 

tctccgggga cactctgtac tggacagact ggcagacccg ctccatccat gcctgcaaca 120 

agcgcactgg ggggaagagg aaggagatcc tgagtgccct atactcaccc atggacatcc 180 

aggtgctgag ccaggagcgg cagccttttt gtgagtgccg g 221 
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<210> 44 
<211> 156 
<212> DNA 

<213> Homo sapiens 
<400> 44 

tttctcagtc cacactcgct gtgaggagga caatggcggc tggtcccacc tgtgcctgct 60 
gtccccaagc gagccttttt acacatgcgc ctgccccacg ggtgtgcaga tgcaggacaa 120 
cggcaggacg tgtaaggcag gtgaggcggt gggacg 156 



<210> 45 
<211> 416 
<212> DNA 

<213> Homo sapiens 
<400> 45 

ctccacagga gccgaggagg tgctgctgct ggcccggcgg acggacctac ggaggatctc 60 
gctggacacg ccggacttca ccgacatcgt gctgcaggtg gacgacatcc ggcacgccat 120 
tgccatcgac tacgacccgc tagagggcta tgtctactgg acagatgacg aggtgcgggc 180 
catccgcagg gcgtacctgg acgggtctgg ggcgcagacg ctggtcaaca ccgagatcaa 240 
cgaccccgat ggcatcgcgg tcgactgggt ggcccgaaac ctctactgga ccgacacggg 300 
cacggaccgc atcgaggtga cgcgcctcaa cggcacctcc cgcaagatcc tggtgtcgga 360 
ggacctggac gagccccgag ccatcgcact gcaccccgtg atggggtaag acgggc 416 



<210> 46 

<211> 198 

<212> DNA 

<213> Homo sapiens 

<400> 46 

ttcttctcca gcctcatgta ctggacagac tggggagaga accctaaaat cgagtgtgcc 60 

aacttggatg ggcaggagcg gcgtgtgctg gtcaatgcct ccctcgggtg gcccaacggc 120 

ctggccctgg acctgcagga ggggaagctc tactggggag acgccaagac agacaagatc 180 

gaggtgaggc tcctgtgg 198 



<210> 47 

<211> 244 

<212> DNA 

<213> Homo sapiens 

<400> 47 

ccgtcctgca ggtgatcaat gttgatggga cgaagaggcg gaccctcctg gaggacaagc 60 

tcccgcacat tttcgggttc acgctgctgg gggacttcat ctactggact gactggcagc 120 

gccgcagcat cgagcgggtg cacaaggtca aggccagccg ggacgtcatc attgaccagc 180 

tgcccgacct gatggggctc aaagctgtga atgtggccaa ggtcgtcggt gagtccgggg 240 

ggtc 244 



<210> 48 
<211> 313 
<212> DNA 

<213> Homo sapiens 



Page 99 



WO 02/092015 PCT/US02/15982 

032796-132, ST25 

<400> 48 

gttcgcttcc aggaaccaac ccgtgtgcgg acaggaacgg ggggtgcagc cacctgtgct 60 

tctgcacacc ccacgcaacc cggtgtggct gccccatcgg cctggagctg ctgagtgaca 120 

tgaagacctg catcgtgcct gaggcctttt tggtcttcac cagcagagcc gccatccaca 180 

ggatctccct cgagaccaat aacaacgacg tggccatccc gctcacgggc gtcaaggagg 240 

cctcagccct ggactttgat gtgtccaaca accacatcta ctggacagac gtcagcctga 300 

aggtagcgtg ggc 313 



<210> 49 

<211> 255 

<212> DNA 

<213> Homo sapiens 



<400> 49 

cctgctgcca gaccatcagc cgcgccttca tgaacgggag ctcggtggag cacgtggtgg 60 

agtttggcct tgactacccc gagggcatgg ccgttgactg gatgggcaag aacctctact 120 

gggccgacac tgggaccaac agaatcgaag tggcgcggct ggacgggcag ttccggcaag 180 

tcctcgtgtg gagggacttg gacaacccga ggtcgctggc cctggatccc accaaggggt 24 0 

aagtgtttgc ctgtc 255 



<210> 50 
<211> 210 
<212> DNA 

<213> Homo sapiens 



<400> 50 

gtgccttcca gctacatcta ctggaccgag tggggcggca agccgaggat cgtgcgggcc 60 

ttcatggacg ggaccaactg catgacgctg gtggacaagg tgggccgggc caacgacctc 120 

accattgact acgctgacca gcgcctctac tggaccgacc tggacaccaa catgatcgag 180 

tcgtccaaca tgctgggtga gggccgggct 210 



<210> 51 
<211> 352 
<212> DNA 

<213> Homo sapiens 



<400> 51 

gtgttcatgc aggtcaggag cgggtcgtga ttgccgacga tctcccgcac ccgttcggtc 60 

tgacgcagta cagcgattat atctactgga cagactggaa tctgcacagc attgagcggg 120 

ccgacaagac tagcggccgg aaccgcaccc tcatccaggg ccacctggac ttcgtgatgg 180 

acatcctggt gttccactcc tcccgccagg atggcctcaa tgactgtatg cacaacaacg 240 

ggcagtgtgg gcagctgtgc cttgccatcc ccggcggcca ccgctgcggc tgcgcctcac 300 

actacaccct ggaccccagc agccgcaact gcagccgtaa gtgcctcatg gt 352 



<210> 52 
<211> 225 
<212> DNA 

<213> Homo sapiens 



<400> 52 

gcctcctcta cgcccaccac cttcttgctg ttcagccaga aatctgccat cagtcggatg 60 

atcccggacg accagcacag cccggatctc atcctgcccc tgcatggact gaggaacgtc 120 

aaagccatcg actatgaccc actggacaag ttcatctact gggtggatgg gcgccagaac 180 
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atcaagcgag ccaaggacga cgggacccag gcaggtgccc tgtgg 225 



<210> 53 

<211> 235 

<212> DNA 

<213> Homo sapiens 



<400> 53 

ctttgtctta cagccctttg ttttgacctc tctgagccaa ggccaaaacc cagacaggca 60 

gccccacgac ctcagcatcg acatctacag ccggacactg ttctggacgt gcgaggccac 120 

caataccatc aacgtccaca ggctgagcgg ggaagccatg ggggtggtgc tgcgtgggga 180 

ccgcgacaag cccagggcca tcgtcgtcaa cgcggagcga gggtaggagg ccaac 235 



<210> 54 

<211> 218 

<212> DNA 

<213> Homo sapiens 



<400> 54 

ccaccctccc gcaggtacct gtacttcacc aacatgcagg accgggcagc caagatcgaa 60 

cgcgcagccc tggacggcac cgagcgcgag gtcctcttca ccaccggcct catccgccct 120 

gtggccctgg tggtggacaa cacactgggc aagctgttct gggtggacgc ggacctgaag 180 

cgcattgaga gctgtgacct gtcaggtacg cgccccgg 218 



<210> 55 

<211> 234 

<212> DNA 

<213> Homo sapiens 



<400> 55 

ggctgcttgc aggggccaac cgcctgaccc tggaggacgc caacatcgtg cagcctctgg 60 

gcctgaccat ccttggcaag catctctact ggatcgaccg ccagcagcag atgatcgagc 120 

gtgtggagaa gaccaccggg gacaagcgga ctcgcatcca gggccgtgtc gcccacctca 180 

ctggcatcca tgcagtggag gaagtcagcc tggaggagtt ctgtacgtgg gggc 234 



<210> 56 

<211> 157 

<212> DNA 

<213> Homo sapiens 



<400> 56 

ttgtctttgc agcagcccac ccatgtgccc gtgacaatgg tggctgctcc cacatctgta 60 

ttgccaaggg tgatgggaca ccacggtgct catgcccagt ccacctcgtg ctcctgcaga 120 

acctgctgac ctgtggaggt aggtgtgacc taggtgc 157 



<210> 57 

<211> 272 

<212> DNA 

<213> Homo sapiens 

<400> 57 

gttctcctct gtccctcccc cagagccgcc cacctgctcc ccggaccagt ttgcatgtgc 60 



Page 101 



WO 02/092015 PCTYUS02/15982 

032796-132. ST25 



cacaggggag atcgactgta tccccggggc ctggcgctgt gacggctttc ccgagtgcga 120 

tgaccagagc gacgaggagg gctgccccgt gtgctccgcc gcccagttcc cctgcgcgcg 180 

gggtcagtgt gtggacctgc gcctgcgctg cgacggcgag gcagactgtc aggaccgctc 24 0 

agacgaggtg gactgtgacg gtgaggccct cc 272 



<210> 58 

<211> 134 

<212> DNA 

<213> Homo sapiens 



<400> 58 

tctccttgca gccatctgcc tgcccaacca 
catcaaacag cagtgcgact ccttccccga 
tggtgagcca gctt 



gttccggtgt gcgagcggcc agtgtgtcct 60 
ctgtatcgac ggctccgacg agctcatgtg 120 

134 



<210> 59 

<211> 274 

<212> DNA 

<213> Homo sapiens 

<400> 59 

gtttgtctct ggcagaaatc accaagccgc cctcagacga cagcccggcc cacagcagtg 60 

ccatcgggcc cgtcattggc atcatcctct ctctcttcgt catgggtggt gtctattttg 120 

tgtgccagcg cgtggtgtgc cagcgctatg cgggggccaa cgggcccttc ccgcacgagt 180 

atgtcagcgg gaccccgcac gtgcccctca atttcatagc cccgggcggt tcccagcatg 240 

gccccttcac aggtaaggag cctgagatat ggaa 274 



<210> 60 

<211> 164 

<212> DNA 

<213> Homo sapiens 



<400> 60 

cttccctgcc aggcatcgca tgcggaaagt ccatgatgag ctccgtgagc ctgatggggg 60 

gccggggcgg ggtgcccctc tacgaccgga accacgtcac aggggcctcg tccagcagct 120 

cgtccagcac gaaggccacg ctgtacccgc cggtgagggg cggg 164 



<210> 61 

<211> 130 

<212> DNA 

<213> Homo sapiens 



<400> 61 

ttggctctcc tcagatcctg aacccgccgc 
acatggacat gttctactct tcaaacattc 
tcccctgcag 



cctccccggc cacggacccc tccctgtaca 60 
cggccactgc gagaccgtac aggtaggaca 120 

130 



<210> 62 

<211> 496 

<212> DNA 

<213> Homo sapiens 
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<400> 62 

tcaaacattc cggccactgc gagaccgtac aggccctaca tcattcgagg aatggcgccc 60 

ccgacgacgc cctgcagcac cgacgtgtgt gacagcgact acagcgccag ccgctggaag 120 

gccagcaagt actacctgga tttgaactcg gactcagacc cctatccacc cccacccacg 180 

ccccacagcc agtacctgtc ggcggaggac agctgcccgc cctcgcccgc caccgagagg 240 

agctacttcc atctcttccc gccccctccg tccccctgca cggactcatc ctgacctcgg 300 

ccgggccact ctggcttctc tgtgcccctg taaatagttt taaatatgaa caaagaaaaa 360 

aatatatttt atgatttaaa aaataaatat aattgggatt ttaaaaacat gagaaatgtg 420 

aactgtgatg gggtgggcag ggctgggaga actttgtaca gtggagaaat atttataaac 4 80 

ttaattttgt aaaaca 496 



<210> 63 

<211> 3081 

<212> DNA 

<213> Homo sapiens 



<400> 63 

cccgccagcc 

tcagagagaa 

tatgattctc 

ctcctggacc 

cacctccgga 

aagctcatgc 

aagatgagag 

ggcgtcaaac 

ctgggcatga 

acttcagcca 

gtcaacatcc 

caccgacacc 

aatctgaata 

gccgaagaca 

agcttctacc 

aaggtgttgg 

agtgatctgt 

aacaccatgc 

aagccgccca 

aagctgcggc 

atcaacaatg 

aatgagatcc 

tccatccacg 

accgccaccc 

ctggctgccc 

ctggactatt 

gacaatctgg 

ctggagacca 

tggatggagg 

atccagggac 

gagcgcctgg 

gtcaatatgg 

tgggaccacg 

cgacagcagc 

ccctggatcc 

ctggaggacc 

aagattgatc 

aagcacacca 

atcgccagga 



cagcccagcc 
ggtggaggaa 
agcaaaccaa 
cggcctggga 
aggcggggac 
tgctgctgga 
tgcacaagat 
tggtgtccat 
tctggaccat 
aggaagggct 
agaacttcca 
ggcccgagct 
cggcttttga 
tcgttggaac 
acgccttctc 
ccgtcaacca 
tggagtggat 
atgccatgca 
aggtgcagga 
tcagcaaccg 
cctggggctg 
ggaggctgga 
aggcctggac 
tctcggagat 
accaggaccg 
atgactcacc 
gggccctaac 
ttgaccagct 
gggccatgga 
tgaccacagc 
ccatcctggg 
cgggcaccaa 
tgcggcagct 
acaatgagag 
agaccaagat 
agctcagcca 
agctggaggg 
actacaccat 
ccatcaatga 



caaccctact 
gaaatccaga 
cgattacatg 
gaagcagcag 
acagatcgag 
ggtcatctca 
ctccaacgtc 
cggagccgaa 
catcctgcgc 
gctcctgtgg 
cataagctgg 
gattgactac 
cgtggcagag 
tgcccgaccg 
tggagcccag 
ggagaacgag 
ccgccgcaca 
acagaagctg 
gaagtgccag 
gcctgccttc 
cctggagcag 
gcgactggac 
tgacggcaaa 
caaggccttg 
tgtggagcag 
cagtgtcaac 
tcagaagcga 
gtacttggag 
ggacctgcag 
ccatgagcag 
catccacaat 
cccctacaca 
ggtgcctcgg 
gctacgcaag 
ggaggagatc 
cctgcggcag 
cgaccaccag 
ggagcacatc 
ggtagagaac 



ccctccccac 
ccctagcacg 
cagccagaag 
agaaagacat 
aacatcgaag 
ggtgaacgct 
aacaaggccc 
gaaatcgtgg 
tttgccatcc 
tgtcagagaa 
aaggatggcc 
gggaagctgc 
aagtacctgg 
gatgagaaag 
aaggcggaga 
cagcttatgg 
atcccgtggc 
gaggacttcc 
ctggagatca 
atgccctctg 
gtggagaagg 
cacctggcag 
gaggccatgc 
ctcaagaagc 
attgccgcca 
gcccgttgcc 
agggaagctc 
tatgccaagc 
gacaccttca 
ttcaaggcca 
gaggtgtcca 
accatcacgc 
agggaccaag 
cagtttggag 
gggaggatct 
tatgagaaga 
ctcatccagg 
cgtgtgggct 
cagatcctga 



gccagggcag 
cgcgcaccat 
aggactggga 
tcacggcatg 
aggacttccg 
tggccaagcc 
tggatttcat 
atgggaatgt 
aggacatctc 
agacagcccc 
tcggcttctg 
ggaaggatga 
acatccccaa 
ccatcatgac 
cagcagccaa 
aagactacga 
tggagaaccg 
gggactaccg 
acttcaacac 
agggcaggat 
gctatgagga 
agaagttccg 
tgcgacagaa 
atgaggcctt 
tcgcacagga 
aaaagatctg 
tggagcggac 
gggctgcacc 
ttgtgcacac 
ccctccctga 
agattgtcca 
ctcaggagat 
ctctgacgga 
cccaggccaa 
ccattgagat 
gcatcgtcaa 
aggcgctcat 
gggagcagct 
cccgggatgc 



cagccgttgc 
catggaccat 
ccgggacctg 
gtgtaactcc 
ggatggcctg 
agagcgaggc 
agccagcaaa 
gaagatgacc 
cgtggaagag 
ttacaaaaat 
tgctttgatc 
tccactcaca 
gatgctggat 
ttacgtgtct 
tcgcatctgc 
gaagctggcc 
ggtgcccgag 
gcgcctgcac 
gctgcagacc 
ggtctcggac 
gtggttgctg 
gcagaaggcc 
ggactatgag 
cgagagtgac 
gctcaatgag 
tgaccagtgg 
cgagaaactg 
cttcaacaac 
cattgaggag 
tgccgacaag 
gacctaccac 
caatggcaaa 
ggagcatgcc 
tgtcatcggg 
gcatgggacc 
ctacaagcca 
cttcgacaac 
gctcaccacc 
caagggcatc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
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agccaggagc agatgaatga gttccgggcc tccttcaacc actttgaccg ggatcactcc 2400 

ggcacactgg gtcccgagga gttcaaagcc tgcctcatca gcttgggtta tgatattggc 24 60 

aacgaccccc agggagaagc agaatttgcc cgcatcatga gcattgtgga ccccaaccgc 2520 

ctgggggtag tgacattcca ggccttcatt gacttcatgt cccgcgagac agccgacaca 2580 

gatacagcag accaagtcat ggcttccttc aagatcctgg ctggggacaa gaactacatt 2640 

accatggacg agctgcgccg cgagctgcca cccgaccagg ctgagtactg catcgcgcgg 2700 

atggccccct acaccggccc cgactccgtg ccaggtgctc tggactacat gtccttctcc 27 60 

acggcgctgt acggcgagag tgacctctaa tccaccccgc ccggccgccc tcgtcttgtg 2820 

cgccgtgccc acagatgtga aatgaatgta atctaataga agcctaatca gcccaccatg 2880 

ttctccactg aaaaatcctc tttctttggg gtttttcttt ctttcttttt tgattttgca 2940 

ctggacggtg acgtcagcct gtacaggctc ccaggggtgg cgtcaaatgc tattgaaatt 3000 

gcgctgaatc gtatgctttt tccttttgat aaataaacaa tgtaaaaatg tttcaaaaac 3060 

ctaataaaat aaataaatac g 3081 



<210> 64 

<211> 1324 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (1324) 
<223> n = A,T,C or G 



<400> 64 

ggccgcccgg 

ccccgccgcg 

ccagcaactc 

actgcaagct 

agagatgcag 

gcacaaacag 

cctctcccaa 

cgctcccgag 

gcaggccctg 

ggcggtcagc 

ctccaaggaa 

gtcggattag 

gcacggagag 

ccatagtatt 

ttcttcctac 

tgccctgaca 

ggagcaagac 

cgacagtctg 

tgcccctttc 

ccacctccac 

gtcctctccc 

gagcagggtg 

tact 



cgcccccagc 
attgacatga 
aaattcacca 
cagtaccaca 
cgtcactatg 
gctgagatcg 
gagcaccagc 
ctgaactcta 
gccctgccct 
gcaggcaccg 
gacaagaacg 
cagggggccg 
aaaggaatgt 
tatggtggcg 
cccattccgg 
tgtgaggcaa 
ctgcgaagcg 
ccctgggcca 
ccgtcctggg 
ctgccccact 
atcgcccaca 
agggtgggcg 



agnccgagcc 
tgtttccaca 
cctcggactc 
gcctcaagct 
tgatgtacta 
tcaaaaggct 
agcaggtctt 
tcatccgaca 
tgaccccact 
gcctcctctc 
ggcacgatgg 
ggacagggag 
ttagcacaag 
ccggcggggc 
cttccctcct 
gctaaggcct 
cagcagcccc 
cagccctctc 
caccccgagt 
caccacctct 
aaaggggggg 
acccaggatt 



ggggcgcaca 
aagcaggcat 
ctgcgaccgc 
cgaatgtgac 
cgagatgtcc 
gaacgggatt 
gggagccatt 
gcagctccaa 
acccgtgggg 
gctgtccgcg 
tgacacccac 
gttgggaggg 
acacagcgga 
cccagcccag 
cctcccctgc 
ggagggtcag 
ggcccttccc 
accctggtac 
ctcccccgac 
gctagttcca 
cacgagggac 
ccccctcccc 



gncggggcgc 
tcgggctcct 
atcaaagacg 
aagttggcca 
tacggcttga 
tgtgcccagg 
gagagggcca 
gcccaccagc 
ctgcagccgc 
ctgggttccc 
caggaggatg 
gggacagagg 
gctcgggatt 
cttgcaggcc 
agcctggtta 
atgggagacc 
ccgttttgaa 
tgcatgcacg 
cccgggtccc 
gacacctcca 
gagcttagct 
ttcccaaata 



agcccgcgcc 
cgcacctacc 
aatttcagct 
gtgagaagtc 
acatcgagat 
tcctgcccta 
agcaggtcac 
tgtcccagct 
cttcgctgcc 
aggcccacct 
atggcgagaa 
ggagacagag. 
ggctaatctc 
acctctagct 
ggtggatacc 
aggtcccaag 
catgtgtaac 
caatgctagc 
aggtatgctc 
cgcccacctg 
gagctgggag 
aagatgaggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660- 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1324 



<210> 65 
<211> 2377 
<212> DNA 

<213> Homo sapiens 
<400> 65 

ggtgacaaag agccaacaga gacaatagga gacttgtcaa tttgtcttga tgggctacag 60 
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ttagagtctg aagttgttac caatggtgaa actacatgtt cagaaagtgc ttctcagaat 120 

gatgatggct ccagatccaa ggatgaaaca agagtgagca caaatggatc agatgaccct 180 

gaagatgcag gagctggtga aaataggaga gtcagtggga ataattctcc atcactctca 240 

aatggtggtt ttaaaccttc tagacctcca agaccttcac gaccaccacc acccacccca 300 

cgtagaccag catctgtcaa tggttcacca tctgccactt ctgaaagtga tgggtctagt 360 

acaggctctc tgccgccgac aaatacaaat acaaatacat ctgaaggagc aacatctgga 420 

ttaataattc ctcttactat atctggaggc tcaggcccta ggccattaaa tcctgtaact 480 

caagctccct tgccacctgg ttgggagcag agagtggacc agcacgggcg agtttactat 54 0 

gtagatcatg ttgagaaaag aacaacatgg gatagaccag aacctctacc tcctggctgg 600 

gaacggcggg ttgacaacat gggacgtatt tattatgttg accatttcac aagaacaaca 660 

acgtggcaga ggccaacact ggaatccgtc cggaactatg aacaatggca gctacagcgt 720 

agtcagcttc aaggagcaat gcagcagttt aaccagagat tcatttatgg gaatcaagat 780 

ttatttgcta catcacaaag taaagaattt gatcctcttg gtccattgcc acctggatgg 840 

gagaagagaa cagacagcaa tggcagagta tatttcgtca accacaacac acgaattaca 900 

caatgggaag accccagaag tcaaggtcaa ttaaatgaaa agcccttacc tgaaggttgg 960 

gaaatgagat tcacagtgga tggaattcca tattttgtgg accacaatag aagaactacc 1020 

acctatatag atccccgcac aggaaaatct gccctagaca atggacctca gatagcctat 1080 

gttcgggact tcaaagcaaa ggttcagtat ttccggttct ggtgtcagca actggccatg 1140 

ccacagcaca taaagattac agtgacaaga aaaacattgt ttgaggattc ctttcaacag 1200 

ataatgagct tcagtcccca agatctgcga agacgtttgt gggtgatttt tccaggagaa 1260 

gaaggtttag attatggagg tgtagcaaga gaatggttct ttcttttgtc acatgaagtg 1320 

ttgaacccaa tgtattgcct gtttgaatat gcagggaagg ataactactg cttgcagata 1380 

aaccccgctt cttacatcaa tccagatcac ctgaaatatt ttcgttttat tggcagattt 1440 

attgccatgg ctctgttcca tgjggaaattc atagacacgg gtttttcttt accattctat 1500 

aagcgtatct tgaacaaacc agttggactc aaggatttag aatctattga tccagaattt 1560 

tacaattctc tcatctgggt taaggaaaac aatattgagg aatgtgattt ggaaatgtac 1620 

ttctccgttg acaaagaaat tctaggtgaa attaagagtc atgatctgaa acctaatggt 1680 

ggcaatattc ttgtaacaga agaaaataaa gaggaataca tcagaatggt agctgagtgg 1740 

aggttgtctc gaggtgttga agaacagaca caagctttct ttgaaggctt taatgaaatt 1800 

cttccccagc aatatttgca atactttgat gcaaaggaat tagaggtcct tttatgtgga 18 60 

atgcaagaga ttgatttgaa tgactggcaa agacatgcca tctaccgtca ttatgcaagg 1920 

accagcaaac aaatcatgtg gttttggcag tttgttaaag aaattgataa tgagaagaga 1980 

atgagacttc tgcagtttgt tactggaacc tgccgattgc cagtaggagg atttgctgat 2040 

ctcatgggga gcaatggacc acagaaattc tgcattgaaa aagttgggaa agaaaattgg 2100 

ctacccagaa gtcatacctg ttttaatcgc ctggacctgc caccatacaa gagctatgag 2160 

caactgaagg aaaagctgtt gtttgccata gaagaaacag aaggatttgg acaagagtaa 2220 

cttctgagaa cttgcaccat gaatgggcaa gaacttattt gcaatgtttg tccttctctg 2280 

cctgttgcac atcttgtaaa attggacaat ggctctttag agagttatct gagtgtaagt 2340 

aaattaatgt tctcatttaa aaaaaaaaaa aaaaaaa 2377 



<210> 66 

<211> 1295 

<212> DNA 

<213> Homo sapiens 



<400> 66 

gggccgccgc ccacccgggc cttgcctcta cctcagtcgt tgccccccga ttttcggctg 60 

gagcccacgg ccccggccct cagcccccgc tctagcttcg ccagtagctc ggccagcgac 120 

gcgagcaagc cgtccagccc ccggggcagc ctgctgctgg acggggcggg ggctggcgga 180 

gctggaggta gccggccctg cagcaatcgc accagcggca tcagcatggg ctacgaccag 24 0' 

cgccacggga gccccttgcc agcggggccg tgcctgtttg gcccacccct ggccggagca 300 

ccggcaggct attctcccgg aggggtcccg tccgcctacc cggagctcca cgccgccctg 360 

gaccgattgt acgctcagcg gcccgcgggg ttcggctgcc aggaaagccg ccactcgtat 420 

cccccggccc tgggcagccc tggagctcta gccggggccc gagtgggagc ggcggggccc 480 

ttggagagac ggggggcgca acccggacga cactctgtga ccggctacgg ggactgcgcc 540 

gtgggcgccc ggtaccagga cgagctaaca gctttgcttc gcctgacggt gggcaccggt 600 

gggcgagaag ccggagcccg cggagaaccc tcggggattg agccgtcggg tctggaggag 660 
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ccaccaggtc ctttcgttcc ggaggccgcc cgggcccgga tgcgggagcc agaggccagg 720 

gaggactact tcggcacctg tatcaagtgc aacaaaggca tctatgggca gagcaatgcc 780 

tgccaggccc tggacagcct ctaccacacc cagtgctttg tttgctgctc ttgtgggcga 840 

actttgcgtt gcaaggcttt ctacagtgtc aatggctctg tgtactgtga ggaagattat 900 

ctgttttcag ggtttcagga ggcagctgag aaatgctgtg tctgtggtca cttgattttg 960 

gagaagatcc tacaagcaat ggggaagtcc tatcatccag gctgtttccg atgcattgtt 1020 

tgcaacaagt gcctggatgg catccccttc acagtggact tctccaacca agtatactgt 1080 

gtcaccgact accacaaaaa ttatgctcct aagtgtgcag cctgtggcca acccatcctc 1140 

ccctctgagg gctgtgagga catcgtgagg gtgatatcca tggaccggga ttatcacttt 1200 

gagtgctacc actgtgagga ctgccggatg cagctgagtg atgaggaagg ctgctgctgt 1260 

ttccctctgg atgggcactt gctctgccat ggttg 1295 

<210> 67 
<211> 3411 
<212> DNA 

<213> Homo sapiens 
<400> 67 

gggcccgggg tcccgccacc accgcgcgcg ggacagattg attcactttg gagctgtaag 60 

tactgatgta ttagggtgca gcgctcattg ttcattgacg cagagtccca aaatgaatat 120 

ccaagagcag ggtttcccct tggacctcgg agcaagtttc accgaagatg ctccccgacc 180 

cccagtgcct ggtgaggagg gagaactggt gtccacagac ccgaggcccg ccagctacag 240 

tttctgctcc gggaaaggtg ttggcattaa aggtgagact tcgacggcca ctccgaggcg 300 

ctcggatctg gacctggggt atgagcctga gggcagtgcc tcccccaccc caccatactt 360 

gaagtgggct gagtcactgc attccctgct ggatgaccaa gatgggataa gcctgttcag 420 

gactttcctg aagcaggagg gctgtgccga cttgctggac ttctggtttg cctgcactgg 480 

cttcaggaag ctggagccct gtgactcgaa cgaggagaag aggctgaagc tggcgagagc 540 • 

catctaccga aagtacattc ttgataacaa tggcatcgtg tcccggcaga ccaagccagc 600 

caccaagagc ttcataaagg gctgcatcat gaagcagctg atcgatcctg ccatgtttga 660 

ccaggcccag accgaaatcc aggccactat ggaggaaaac acctatccct ccttccttaa 720 

gtctgatatt tatttggaat atacgaggac aggctcggag agccccaaag tctgtagtga 780 

ccagagctct gggtcaggga cagggaaggg catatctgga tacctgccga ccttaaatga 840 

agatgaggaa tggaagtgtg accaggacat ggacgaggac gatggcagag acgctgctcc 900 

ccccggaaga ctccctcaga agctgctcct ggagacagct gccccgaggg tctcctccag 960 

tagacggtac agcgaaggca gagagttcag gtatggatcc tggcgggagc cagtcaaccc 1020 

ctattatgtc aatgccggct atgccctggc cccagccacc agtgccaacg acagcgagca 1080 

gcagagcctg tccagcgatg cagacaccct gtccctcacg gacagcagcg tggatgggat 1140 

ccccccatac aggatccgta agcagcaccg cagggagatg caggagagcg cgcaggtcaa 1200 

tgggcgggtg cccctacctc acattccccg cacgtaccgg gtgccgaagg aggtccgcgt 1260 

ggagcctcag aagttcgcgg aggagctcat ccaccgcctg gaggctgtgc agcgcacgcg 1320 

ggaggccgag gagaagctgg aggagcggct gaagcgcgtg cgcatggagg aggaaggtga 1380 

ggacggcgat ccatcgtcag ggcccccagg gccgtgtcac aagctgcctc ccgcccccgc 14 40 

ttggcaccac ttcccgcccc gcttgtgttg gacatgggct tgtgccgggc tccgggatgc 1500 

acacgaggag aaccctgaga gcatcctgga cgagcacgta cagcgtgtgc tgaggacaac 1560 

tggccgccag tcgcctgggc ctggccatcg ctccccggac agtgggcacg tggccaagat 1620 

gccagtggca ctggggggtg ccgcctcggg gcacgggaag cacgtaccca agtcaggggc 1680 

gaagctggac gcggccggcc tgcaccacca ccgacacgtc caccaccacg tccaccacag 1740 

cacagcccgg cccaaggagc aggtggaggc cgaggccacc cgcagggccc agagcagctt 1800 

cgcctggggc ctggaaccac acagccatgg ggcaaggtcc cgaggctact cagagagtgt 1860 

tggcgctgcc cccaacgcca gtgatggcct cgcccacagt gggaaggtgg gcgttgcgtg 1920 

• caaaagaaat gccaagaagg ctgagtcggg gaagagcgcc agcaccgagg tgccaggtgc 1980 

ctcggaggat gcggagaaga accagaaaat catgcagtgg atcattgagg gggaaaagga 2040 

gatcagcagg caccgcagga ccggccacgg gtcttcgggg acgaggaagc cacagcccca 2100 

tgagaactcc agaccyttgt cccttgagca cccctgggcc ggccctcagc tccggacctc 2160 

cgtgcagccc tcccacctct tcatccaaga ccccaccatg ccaccccacc cagctcccaa 2220 

ccccctaacc cagctggagg aggcgcgccg acgtctggag gaggaagaaa agagagccag 2280 

ccgagcaccc tccaagcaga ggtatgtgca ggaggttatg cggcggggac gcgcctgcgt 2340 
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caggccagcg tgcgcgccgg tgctgcacgt ggtaccagcc gtgtcggaca tggagctctc 24 00 

cgagacagag acaagatcgc agaggaaggt gggcggcggg agtgcccagc cgtgtgacag 24 60 

catcgttgtg gcgtactact tctgcgggga acccatcccc taccgcaccc tggtgagggg 2520 

ccgcgctgtc accctgggcc agttcaagga gctgctgacc aaaaagggca gctacagata 2580 

ctacttcaag aaagtgagcg acgagtttga ctgtggggtg gtgtttgagg aggttcgaga 2640 

ggacgaggcc gtcctgcccg tctttgagga gaagatcatc ggcaaagtgg agaaggtgga 2700 

ctgataggct ggtgggctgg ccgctgtgcc aggcgaggcc cttggcgggc acgggtgtca 2760 

cggccaggca gatgacctcg tactcaggag cccgatgggg aacagtgttg ggtgtaccac 2820 

ccatccctgt ggtctacccg tgtctagagg caggtagggg gtccctccaa gtggtccaca 2880 

agcttctgtc ctgcccccaa ggaggcagcc tggaccactc ctcatagcaa tacttggagg 2940 

gcccagccca agtgaggcag ccgaggtccc tgctgccagc ttcaggtgac ccccccccat 3000 

cccccggcac ctcccttggg cacgtgtgct gggatctact ttccctctgg gatttgccca 3060 

cgtacccagg tctggctggg gcccaggccc ggatgcagag gcctgcaggg cctctgtcaa 3120 

ttgtacgcgc caccaagtgc cttcaacaca gcttgtctct tgcctgccac tgtgtgaatc 3180 

ggcgacggag cactgcacct gcctccagcc gccggctgtg cagtcctggg tcctcctttc 3240 

tgagggcccg tgtaaatatg tacatttctc aggctagggc cagcaggggc tgcccgagtc 3300 

tgtttttcat gcgatgacac. ttgtacaatt atcttttcaa aggtacttgg ataataatga 3360 

aataaaactg tttttgaacc tgaataaaaa aaaaaaaaaa aaaaaaaaaa a 3411 

<210> 68 

<211> 3140 

<212> DNA 

<213> Homo sapiens 

• <400> 68 

ggctgcgagt acctccatgg tcccggtggc tgtgacggcg gcagtggcgc ctgtcctgtc 60 

cataaacagc gatttctcag atttgcggga aattaaaaag caactgctgc ttattgcggg 120 

ccttacccgg gagcggggcc tactacacag tagcaaatgg tcggcggagt tggctttctc 180 

tctccctgca ttgcctctgg ccgagctgca accgcctccg cctattacag aggaagatgc 240 

ccaggatatg gatgcctata ccctggccaa ggcctacttt gacgttaaag agtatgatcg 300 

ggcagcacat ttcctgcatg gctgcaatag caagaaagcc tattttctgt atatgtattc 360 

cagatatctg tctggagaaa aaaagaagga cgatgaaaca gttgatagct taggccccct 420 

ggaaaaagga caagtgaaaa atgaggcgct tagagaattg agagtggagc tcagcaaaaa 4 80 

acaccaagct cgagaacttg atggatttgg actttatctg tatggtgtgg tgcttcgaaa 540 

actggacttg gttaaagagg ccattgatgt gtttgtggaa gctactcatg ttttgccctt 600 

gcattgggga gcctggttag aactctgtaa cctgatcaca gacaaagaga tgctgaagtt 660 

cctgtctttg ccagacacct ggatgaaaga gttttttctg gctcatatat acacagagtt 720 

gcagttgata gaggaggccc tgcaaaagta tcagaatctc attgatgtgg gcttctctaa 780 

gagctcgtat attgtttccc aaattgcagt tgcctatcac aatatcagag atattgacaa 840 

agccctctcc atttttaatg agctaaggaa acaagaccct tacaggattg aaaatatgga 900 

cacattctcc aaccttcttt atgtcaggag catgaaatcg gagttgagtt atctggctca 960 

taacctctgt gagattgata aataccgtgt agaaacgtgc tgtgtaattg gcaattatta 1020 

cagtttacgt tctcagcatg agaaagcagc cttatatttc cagagagccc tgaaattaaa 1080 

tcctcggtat cttggtgcct ggacactaat gggacatgag tacatggaga tgaagaacac 1140 

gtctgctgct atccaggctt atagacatgc cattgaggtc aacaaacggg actacagagc 1200 

ttggtatggc ctcgggcaga cctatgaaat ccttaagatg ccattttact gcctttatta 1260 

ttatagacgg gcccaccagc ttcgacccaa tgattctcgc atgctggttg ctttaggaga 1320 

atgttacgag aaactcaatc aactagtgga agccaaaaag tgttattgga gagcttacgc 1380 

cgtgggagat gtggagaaaa tggctctggt gaaactggca aagcttcatg aacagttgac 1440 

tgagtcagaa caggctgccc agtgttacat caaatatatc caagatatct attcctgtgg 1500 

ggaaatagta gaacacttgg aggaaagcac tgcctttcgc tatctggccc agtactattt 1560 

taagtgcaaa ctgtgggatg aagcttcaac ttgtgcacaa aagtgttgtg catttaatga 1620 

tacccgggaa gaaggtaagg ccttactccg gcaaatccta cagcttcgga accaaggcga 1680 

gactcctacc accgaggtgc ctgctccctt tttcctacct gcttcactct ctgctaacaa 1740 

tacccccaca cgcagagttt ctccactcaa cttgtcttct gtcacgccat agttggctac 1800 

tctcaagcca gcacattgtt agacccatct taattaagcc ttacctccat gtaaagaaca 18 60 

gcacgtctgt tccaaggacc tcagctcttc ttgtttctac agatggcaac agctccatag 1920 
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ggacagcttg tataattacc ttcagaggcc aactgacaga atcctggcag gaacagacat 1980 

tatcttgcca gttagaagta cttctgtctc acttatgtcc aaagagtggc tatagatctt 2040 

ggccttcttc cctgaatgct tttttttttt ggcccccaag aaagtccctt ttatagcact 2100 

ttagcacagg caatgctaca ggaacaaagt ttcaatgctg ctgagagtga aagaaaggag 2160 

gaaagtctgc cactctaccc tgagctggca gtagggcact gagtaccctt aggaagaagt 2220 

cagagcaatg gatacaaatg accttgctct tggatttgct gagcatgatc cctattctga 2280 

tgtcagagat taggtttaaa tggaatagag ctatccattt gttcttactc tctagggaga 2340 

caatcttcca aaacagtttt gggggggtct tctaaagctt tcaaattgga agtaacttta 2400 

ttcaactaga gttgaataaa agaagggcaa aaataatctc acagagcttg gaactgctga 24 60 

tagcccttac tgagggcaaa agatggctat attgttagct atactcctac caaagcaagc 2520 

aaggagatag gattatagat aatttcacgg acatttggaa ataacattgg tgattataca 2580 

gacaagaata aactcacttc aagctggtct gttttaataa attttcaacg taattgtcta 2640 

tttttttccc tcccatctgc aacagaatac atttttttca gcctttatct agatgaggta 2700 

aagggaatca ttcttatggt gctcttggag agtttcaggc ctgtgcatgt gtgtacagca 2760 

ggaggtaata tgctataatg tctgctgtaa tatatttgca cagtagatgc tatggatcat 2820 

tctgagctca gggtccagac tttattctta ttcccagaat tttgtgttac gtttttacct 2880 

cctaacatat gacacttcat cttatattaa ggaaggttta gaatatctaa tacgacttga 2940 

attcatttgt tactaagcct tctcaggcaa gctgtatact agttactggt ctccactgcc 3000 

atgccttttc aaggttccca tggtccagaa tgatgtttga ttcttaattt ttctgtccct 3060 

tttataattt gttttaatga ttttgctaca tttggaattc aataaaaaat gtgaacaata 3120 

ataaaaaaaa aaaaaaaaaa 3140 

<210> 69 

<211> 3513 

<212> DNA 

<213> Homo sapiens 

<400> 69 

ccgtgtacca ggtgctgcta gtgggaagca cgctgctgaa ggaagtgcct tccgggctgc 60 

agctggagca gttgccttct cagagcctgc tgacccacat cccaacggcg gggctgccca 120 

cttcgctagg aggaggcctg ccttactgcc accaggcctg gctggatttc cgaaggcggc 180 

tggaagctct actacagaac tgccaggcag cttgtgccct gctccagggg gccatcgaaa 240 

gtgtgaaggc tgtgccccag cccatggagc ctggggaggt cggtcagctg ctacagcaga 300 

cagaggtcct gatgcagcag gtgctagact cgccatggct ggcatggcta caatgccagg 360 

ggggccggga gctgacatgg ctgaagcaag aggtcccaga ggtgaccctg agcccagact 420 

acaggacggc aatggacaag gctgacgagc tatatgaccg ggtggatgga ttgctgcacc 4 80 

aactgaccct gcagagcaac cagcgaatac aggccctaga gttggtccaa acactggagg 540 

cccgggaaag cggactgcac cagattgaag tgtggctgca gcaggtgggc tggccagcac 600 

tggagg.aggc tggggagccc tcgctggaca tgctgctcca ggcccaaggc tcttttcagg 660 

agctgtacca ggttgcccag gagcaggtca ggcaagggga gaagtttctg cagccgctga 720 

ctggctggga ggcggctgaa ctggaccccc ctggggcacg ctttctggcc ctgcgagccc 780 

agctgactga attctctagg gctttggccc agcggtgcca gcggctggcg gatgctgaga 840 

ggctgtttca gctcttcagg gaggccttga cgtgggctga ggaggggcag cgagtgttgg 900 

cagagctgga gcaggaacgc ccgggggttg tgttgcagca gctgcagctg cactggacca 960 

ggcaccctga cttgcctcct gcccacttcc gaaagatgtg ggctctggcc acggggctgg 1020 

gctcagaggc catccgccag gagtgccgct gggcctgggc gcggtgccag gacacctggc 1080 

tggccctgga ccaaaagctt gaggcttcac tgaagctacc accggtgggc agcacagcta 114 0 

gcctgtgtgt cagccaggtc cccgctgcac ctgcccaccc tcccctgagg aaggcctaca 1200 

gcttcgatcg gaatctgggg cagagtctca gtgaacctgc ctgccactgc caccatgcgg 1260 

ccactattgc tgcctgccgc agaccagagg ctggaggagg tgccctgccc caggcatccc 1320 

ctactgtgcc tccaccaggc agctctgacc ccaggagcct caacaggcta cagctggtgc 1380 

tggcagagat ggtggccacg gagcgggagt atgtccgggc tctagagtac actatggaga 14 40 

actatttccc cgagctggat cgccccgatg tgccccaggg cctccgcggt cagcgtgccc 1500 

acctctttgg caacctggag aagctgcggg acttccactg ccacttcttc ctgcgtgagc 1560 

tggaggcttg cacccggcac ccaccacgag tggcctatgc cttcctgcgc catagggtgc 1620 

agtttgggat gtacgcgctc tacagcaaga ataagcctcg ctccgatgcc ctgatgtcaa 1680 

gctatgggca caccttcttc aaggacaagc agcaagcact gggggaccac ctggacctgg 174 0 
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cctcctacct 
tggcacgggc 
ttgtgcactt 
gtgatgttaa 
ctgggcgcca 
agcctcgcca 
cagaccttgg 
gccgccgcaa 
cctggacagc 
gcatggctga 
gcgaggaagc 
ctcgggcgtc 
actcccttcc 
acagagaccc 
cagcactgga 
agatctcgtc 
cagggcaggc 
cgagcagtag 
accagggtgt 
cctttctgat 
ttgtagagcc 
ccagttgtgg 
tgggggacca 
ggtgggctcc 
gggacaggtc 
cctttttact 
aattgtctaa 
gtctttgatg 
aaaaaaaaaa 



gctaaagccc 
ctgcgggggc 
ccagctgcgg 
cctcaaggaa 
caagtccgtg 
tgggcccaca 
tctcactgag 
ggccagggac 
tgacatctcc 
gatggtgtcc 
catcaacgac 
cattgccgta 
tggagaccct 
agctcttctg 
ggctgaggca 
ccaatgccca 
cctgggtagg 
atccagcagc 
ggctgacacc 
gtgtgtggcc 
tgaataggct 
gttaagaata 
tgtgcctcta 
tagcagctga 
cacctctact 
ggggcagttt 
aattatctta 
tgtgatttaa 



atccagcgca 
cccacgcagg 
cacggaaacg 
caggggcagc 
cgccgcatct 
ggggttgaca 
tgctgtggga 
acctttgtgc 
cacctgcttt 
atgggtgtgg 
cgcaccgtca 
gccccgtttg 
gcctcttgct 
ggtctccgct 
gagctgggcg 
tcagccagtg 
ggcctggagg 
ctgcagctcc 
tgggctacct 
attggactaa 
cctggcccca 
ggctagagca 
ggcagtgact 
tccccaatgc 
gggccctcat 
cgttattttg 
ctggttgtta 
tcttttattt 



tgggcaagta 
agctcagtgc 
acctgctggc 
tggtgcgaca 
tcctttttga 
catttgccta 
acagcaacct 
tgcaggcctc 
ggaggcaggc 
ggaacaaggc 
actatgtcct 
accatgacag 
ctgttctggg 
gtcccctgta 
gccagccctc 
gctccagtgg 
acttaccctg 
aaggaacatt 
ccaacctaca 
ctggcacggg 
tgaccccttc 
gacattgggt 
agggtgcccc 
ctggccttaa 
gcttgccttt 
acttgatgcc 
ggcctttggt 
gtttataata 



cgcactgctg 
gctgcgggag 
catggacgcc 
ggatgagttt 
ggagctgctg 
caagcgctcc 
gcgcttcgag 
cagcctggct 
cgtccacaac 
cttccgagac 
gaagtgccga 
cctctacctg 
gtccctcaac 
tcccagcttc 
tttgactgct 
ctctgacagc 
tgtctgagcc 
gcctctctgg 
tgtgcaacgc 
gcctctctag 
tcctgtcccc 
gtttccatgc 
cacccctcag 
agccgagctc 
cctggccccc 
ttttgaataa 
gtctcagaga 
aaaaatagac 



aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 



ctgcaggagc 
gcccagagcc 
atccagggct 
gtggtgcgca 
ctcttcagca 
ttcaagatgg 
atctggttcc 
atcaagcagg 
aaggaggtgc 
attgctccca 
gaagttcgct 
ggggcctcga 
ctgcacctgt 
ctagaggaag 
gaggactcag 
agctgtgtgt 
cgggactgga 
atctgctgtg 
tgttgactac 
ggaagtctgg 
agctcccatc 
tgtaggctgg 
gaagaacaca 
agttaccata 
aggcccagcc 
ctttcaatag 
aggagtctag 
tgatttgtaa 
aaaaaaaaaa 



aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3513 



<210> 70 
<211> 3597 
<212> DNA 

<213> Homo sapiens 



<400> 70 

catgccagtt 

atttgggaac 

tgatccctgc 

agtgagcttg 

caaagtttga 

aaaagtctct 

gatgtactgg 

ccgttgcatg 

ctcattccac 

gagtacagga 

taccccaccc 

gggtccccat 

atccttacca 

ccctctgtga 

ggacacagta 

aaggacaaac 

gtaggaaggt 

ctccttccag 

gggtctttcc 

ccaggcagct 



acttcctcag 
agtggataac 
acccaaagcc 
gctgcttgtg 
ggcagactgt 
gacctggtcc 
cccagcatct 
tctggcttct 
cgtgtctctc 
gctgctcaga 
aaagctgcct 
ccgaggatcc 
ctgtttccag 
cctaaaaaga 
atgtgaagtc 
ctcccacctc 
gctctgcagg 
gaacagtgag 
atgactgagt 
ccgtgccctg 



gaaaatattt 
ccaagtgtcc 
agcccctaaa 
aggagctaca 
ccaggactgt 
cctcaggccc 
ccacctcccc 
gcctgtgctc 
ctgcacacat 
cctctcagcc 
tcctggctgt 
acaagcaatg 
agccgtgaac 
aaagctcaat 
aaatacttaa 
acagacagcc 
cacaaggccc 
cccagtgagc 
cacctccagg 
gcttctcccc 



tcttgccttc 
cacaggccaa 
gcctacccct 
agtgaaagag 
tcccaggaag 
agctacctgc 
catgcaacca 
ctcctgccac 
agcctctgtc 
cagccctctg 
aggagctccc 
acttcccaaa 
gatgctgtga 
ttccatctgt 
ccgagcaaag 
aagcagtgag 
agagaagccc 
agtcccagcc 
agctcacctg 
atgccccaaa 



ttctttcagt 
ggtatattcc 
tgtgcacccg 
aagttatttt 
aagcaggagt 
gcccaccagc 
ggtccctggt 
gagcatcctc 
ccagggcgat 
tgactgcccc 
tcgtctagcc 
tgacctccac 
tggcccaggt 
cttctttccc 
ggccagtgtt 
ggaaagacag 
ctctccggga 
agctcttcaa 
acccccagag 
tcccccccag 



atggttttaa 
aatggcagca 
cagcctggta 
aaataaatcc 
tacccacagg 
agtgaaggtt 
accgtgtctc 
cctgtccctc 
ttatccactt 
agccccatcc 
aaggccctat 
tgcaagaaga 
ctcagcacca 
aggaccaagg 
gttatcagtc 
acagacatag 
acttcccctg 
ggccttcaag 
aagacctacc 
ccatccctcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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tggtcctcgt 
ctgggagacc 
gggccattta 
aaagtaagaa 
aaacctatga 
tgaaaaaaaa 
tcacatggct 
ctaccgcctc 
cttatagagt 
gctgggacac 
acctaaatga 
gcaagagaga 
tttattagag 
tacatactca 
aagcaaagtt 
attggcagag 
ctttattcca 
ctgccgaatc 
gatgggtcat 
ggtctatgca 
tatctttgat 
aaaacagagg 
accgaacagg 
cgaagatggg 
tagttagctt 
aagaatcttg 
ctcaccactc 
gttggctgct 
tgccactgcc 
cctgctgagt 
gagccgacgt 
ttcaaatgct 
tctttgagtc 
acccccttcc 
ccctttgtta 
tgtatgtgga 
tcctagtaaa 
aattggagcc 
gtatacctgt 
tatttgtaat 



ctacatcaag 
ttgaaccctc 
tttgggattt 
gggagccctg 
cttccaaggg 
aaaatgggtc 
gctgctattc 
ctgccacctc 
ggggagctga 
tctgcaaacc 
ccactcctct 
gttttttttt 
ggagtatcca 
ttaggatccc 
aaacaaaacg 
gtagtgaggt 
ttcccacccg 
ctacacatct 
gggaagaccc 
gctcttcttc 
cttttgactc 
atacagttaa 
actaaggaac 
atacacgtgc 
gttgacttgt 
tttccattcg 
cctaccttgg 
ttgagcagga 
ctcctcccca 
taccatagca 
gtggggatga 
agagaagggg 
acatagagaa 
agccccatcc 
gaacaaagca 
tgagatctct 
atattaaaaa 
cagttgtaac 
gtatgtaaat 
atagaataaa 



ggcctcttcc 
agcctcttcc 
tggagttgtt 
ctatggatgt 
gatgaggaga 
aggacattcc 
ctggtgcttc 
accctacagg 
gacacagcct 
tctcctcatg 
catttggaaa 
ttttttacta 
agttttaaaa 
gattaaattc 
aaacaaaatc 
gattcaggta 
ctttgatgtc 
cttatcagct 
aagctttccc 
caacaatttc 
aagcacattt 
agagcaaccc 
actttcgtgc 
tcatctgtct 
taaaccttct 
gaaaccaacg 
cctggtgtcc 
acgaggcctc 
ctttagcatg 
accagcaact 
tgcaggggtt 
agaaacagga 
gccgttgaag 
cacttctgtt 
aagcatatct 
aagaagaggg 
aaaattaaaa 
ttggtaaagg 
ataaggcatt 
gtttattaaa 



cctcttcctc 
tttaaaaaaa 
tggtttttgt 
taagtccaaa 
ggttcagagg 
ctggatgagg 
cccacacttt 
cattctctat 
caggtaacac 
gtgctaaggg 
gtaatccact 
caaatttttg 
gtatatagaa 
cttgagtaga 
cttcatatac 
ggctgaaaat 
tatgcttggc 
ttcctcaaac 
tccgccagga 
tttatttgga 
tggaagggct 
aaaggacgct 
acagaagtca 
gtcctccttt 
gttcttaaaa 
attttgtgtt 
ttgagacata 
caggccctga 
taggggccag 
ccagggtacc 
ttggcccagc 
tggaagggtg 
gaggtagggc 
tcaaactaaa 
ttaaacaaca 
tcttctggtt 
agtttttaaa 
caagcttctg 
cctattttgc 
aaataagaaa 



ccagctctca 
acaaaacaaa 
ttttatatct 
ttactcggtt 
acaggaggag 
acaatgctag 
tgacagatgg 
gtaggaaaca 
tgacacagct 
tggcatgctc 
gcagtaaaag 
ctcccccata 
ttttttggtt 
ctggtgccta 
aaaaagaact 
cctgggttgc 
tctctgggct 
tttaaggagg 
ttgcaaaagc 
attagaactt 
cccttacaaa 
taagaaaccg 
gccgcaatcc 
cctctccctc 
tgaaaagcta 
ttagaatgga 
cggtctttgc 
ggtgggaagg 
cccatctctt 
acaacagaca 
cagaggaccc 
gtttaaggaa 
aggttatctc 
gctcccacct 
gtgttaaaat 
ttgattttta 
aaggaaacct 
tacctttgtt 
agttcagaac 
aaaaaaaaaa 



ggacaggtga 
acaaaactgt 
taatagttcg 
agtgggagca 
cctcccccat 
gggtggcatc 
agtccttctc 
agagccttat 
cccgaatgag 
ttgacaggaa 
tttcagacat 
aaattatttt 
gtaagagaaa 
ccagaaagca 
ttctgtttgt 
gggagcctca 
gcccctggta 
ctctgtgagg 
aagtagactt 
cctttgttag 
agtagaattt 
agaccacttc 
aggcacagga 
cccgacgttc 
gcttacctca 
cagccctccc 
ttagtcgtgt 
aaggattgga 
ccagcagggt 
atggctcagc 
agagttgagc 
ctggcagggg 
tgttccagtc 
cgaacattga 
gagcctcaaa 
aagaagagta 
atgctattta 
ataattaatt 
aaaaaaaact 
aaaaaaa 



<210> 71 

<211> 855 

<212> DNA 

<213> Homo sapiens 



<400> 71 

cgctcaatta 

ggatagcaac 

catgccacgt 

gcagggagga 

cgcacgcagt 

ttgcacacgg 

atttccatga 

tacaggaagt 

agtcgtctcc 

tggcacgcct 



tctactcgag 
ggcgtccctc 
ggtagtctct 
taggacgggc 
agatgttgtt 
agcaccggaa 
tgagatgtcc 
cctcctggca 
cacacgagtc 
gctgggctcc 



tctagactcg 
ctccccgctc 
gtccatggac 
acaggaggcg 
ctccacgtcc 
gcagcctggg 
acacacgctg 
gtacactttc 
gcaggtgaag 
gtagatgcca 



aggcggccgc 
agctgcagcc 
accacacgga 
cattttggtg 
acggtgaagg 
tggtaggact 
catttgtcgg 
tcacccacgt 
cagtcagtgt 
agcccacact 



ccattgtgca 
cgcagtcctc 
tggttgtctc 
caaaaaccgt 
gaaccccgtc 
tgcccagggc 
ccgtctgctg 
tgtagaacgc 
gataaagact 
tgatgcaaat 



ctaaagcagg 
acagtggtaa 
gcagccctgt 
gtgatagtct 
caggcactca 
ctgcaggatc 
gaacccggag 
cttcccacgg 
ccccattgcc 
gccgaagtag 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3597 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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tcccgcgccg tgcgcgcctc gagcgcccgc tccagctccc gggtgagcgc ctccagccgc 
cgctcggccg cgcttgggcc gccctcccgg ccagggggca gcgggagtgc aggcagcggg 
aagggggccg gccccgcagg ctccggggag cgagcgggcg cggggcaggc gccgggcggg 
aggaagtcag cgtaggcttt agtgcacaat gggaattcgg atccggtcga cactagttct 
aggatctctt ttttt 

<210> 72 

<211> 3791 

<212> DNA 

<213> Homo sapiens 



660 
720 
780 
840 
855 



<400> 72 

acagacggcg 

tggttgaaga 

ggaaaaccat 

acctctggaa 

ctgcaggaag 

ttatcttcag 

gatgagcagc 

gcagtagacc 

gggcagcaac 

atatctttaa 

atgggagaca 

ccgtcccatc 

aaggaaagcc 

aacgccttct 

ttccgtgtca 

agagaaatga 

tcaagtgtgg 

caagaagctc 

tcttggatgt 

aatctccgcc 

cctttctcta 

acaaacacat 

ctcctgtgtg 

ctgctgagct 

cagctggctg 

agtgggataa 

cagcaggcca 

cttcacctat 

cgtgcagaaa 

cagagcctca 

aaggatgcgg 

cagctggaac 

acccaactgg 

acttctacct 

ttgctgagtc 

caggaacgtg 

tctgcccagt 

ctagcaacag 

gagctacaga 

tgccagctta 

ctaaaacaca 

gacctgaaag 

ggtcaggttg 

tgtgagaacc 

gataaccagg 



ggtgaacatg 
tgtggcgagt 
ctatgagaac 
aaagatcccc 
gcagcaacaa 
aacatttcag 
ctctagatcc 
cactgggcaa 
aagacatgat 
atggaccttt 
ggttttcaga 
tcttagagga 
tgagcagtag 
tgccttcctc 
atcatgtgga 
ggtttcccac 
aagatattct 
caggcccagc 
ccccactggc 
aaagcttatc 
cttgctcggt 
cccagacagg 
gccggcctcc 
ctcttgtcat 
tccctcaccc 
ctaataaact 
gaaatgtcat 
ccctgttgca 
cattggtctg 
aagcagaaag 
cagagatagt 
aggacctagc 
tagggcttca 
tgcaacaaga 
ggtcccgaca 
atgtggcaat 
tagaggagtg 
atctccgggc 
gtcagcatac 
cccagagcaa 
tgcaggcaga 
agaccttgga 
agtgtcaatt 
tcaaggacac 
agcaagatct 



gcgtcctcga 
gaaaaaactg 
tcctctccgt 
cgcttgctcc 
ctcgtctcca 
tcattcctca 
aattccccaa 
ttatatggtt 
atttgaggcc 
gagaacagac 
agttgctgct 
gtctccacca 
aactgaggct 
tgttctctgg 
cccagaggag 
acatcctaag 
gtccacatgc 
agtagaagat 
ctggctggaa 
ccttccctcg 
ggggacttgg 
cctggttggc 
agatctgact 
tgtggagttt 
agaaacccag 
tcagcatctt 
gcaatcatgg 
tttagaagaa 
ttgctgtttt 
ggaggaggca 
gttggaggct 
atccatgcgg 
tgccaagcaa 
ctggaggtcc 
actcacagag 
tgaggaaaag 
caaaggccaa 
tcagttgcag 
ccattgtgcc 
tgaggagcag 
actgcagcag 
gtttgcagac 
gaaaaccaca 
tgtagagaac 
ggagaaaaca 



cttggtctga 
agcctcagcc 
gaacttaccc 
tcgctgaccc 
gtggattttg 
aagtggctag 
attagctcta 
aaaaccatcg 
cgtttagata 
gatctggtga 
gtatctgaga 
aatccctgtt 
gtgcgtgagg 
ctttcccctt 
gaaattgtag 
gagtctgaaa 
ctgacaccaa 
gttggtagga 
aaaggtgtaa 
atgcttcggg 
tttactcctt 
accaagcaca 
gccttgtctc 
ctctcccgcc 
gacagtagca 
aaggagagcc 
gtgcttatct 
gataagacta 
gatttgctga 
aggcacagag 
ttctgtgcac 
gaattcagag 
gaagagctgg 
atgcaactgg 
aaactcacag 
caggaggttt 
acagaacaac 
attctggcca 
caggacctgg 
gctgctcaat 
caacaagctg 
caggagaatc 
ctggaagtgc 
ctaacggcta 
cggcagtact 



gacgtgatag 
tgtcgccttc 
tgcagcccgg 
catcactgtg 
taaataacaa 
aaacttgtca 
ctcctaaaac 
tccttgtacc 
ccatggcaga 
gagaggaggt 
aacctatctt 
ctgaacaact 
acttagtacc 
caactgcctt 
agcatggagc 
cagaagatca 
atctagtaga 
ttcttggctc 
atacctccgt 
atgctgcaat 
cagcaccaca 
gtacttctga 
gacatgactt 
agcttcggga 
cacagactga 
atgagatggg 
ctaaagagct 
ctgtgaatca 
agaaattgag 
aggaaatggc 
acgccagcca 
gccttctgaa 
ttcagcagac 
attatacaac 
tcaagagcca 
ctagggtgct 
tggagttgga 
acatggacag 
ctatgaagga 
gcgtaaagga 
tcctggccaa 
aggttgctca 
tccgggagcg 
aactggccag 
ctcaaaagct 



gcctgccttc 
gccccagacg 
tgccctcacc 
caagctgggg 
gaggacagac 
gcatgaatca 
gtctgaggaa 
atctccactg 
gacaaacagc 
ggcaccctgc 
tcaggaatct 
acattgctcc 
ttctgaaagt 
ggcagcagat 
tatggaggaa 
agcacttgtc 
aatggaatcc 
tgatacagag 
catgctggaa 
tggcactacc 
ggaaaagagt 
gacagagcag 
ggaagataac 
ctggaagagc 
cacatctcac 
acaggcccta 
gatatccttg 
ggagtctcgg 
ggcaaagctc 
tctcagaggc 
gcgcatcagc 
ggatgcccag 
agtgagtctt 
atggacagct 
gcaagccctg 
ggaacaagtc 
aaacattcgt 
ccagctaaaa 
tgagttactc 
agagatggca 
agaggtgcgg 
cctggagctg 
cagcttgcag 
caccatagca 
agggctgctg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
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actgagcaac 
caagagaccc 
gacaggacct 
cctgtgccct 
cttcagcccg 
actgagcttc 
ctgcagcgaa 
gaagtccaga 
tacaagaatg 
atagacaaga 
cttcggcgtg 
ccaactgcca 
tggacaaact 
gttccagagc 
aaaaactaga 
tgaggggatg 
agaatttgtt 
ggcatagagc 
aaaaaaaaaa 



tacagagcct 
ttctgctgag 
tcctgggaag 
tgcttggaag 
cagagacccc 
agagtctttg 
aaatttgtga 
aggcaaaaga 
aaaaggagct 
gtggcgagct 
cggagacaga 
gcctatggcc 
gagagtgatg 
ccatagaaat 
tgacattgtt 
caaagaacta 
tcacctcttt 
caactgagat 



gactctcttt 
tacagcctgt 
catcttgaca 
tgacaagagt 
aggcatggag 
ttccctgcta 
gctgcaagct 
agcagacata 
ccaggaagtg 
cataagcctt 
gaccaaagtg 
accaattgga 
ttcctggaga 
atcctagagg 
cagcatattt 
cagggattgc 
ttacctgcaa 
aaatgctatt 



ctacagacaa 
cctcccaccc 
gcagtggcag 
gctttcaccc 
gagagcctgg 
caagagtcta 
aggctgcagg 
gagaagctga 
atacagcaga 
agagaggagg 
ctccaggagg 
tccaggagaa 
tgaaaaatga 
agaaccttcg 
ataagaccct 
tggaatttct 
taccccctta 
taaataaagt 



aactaaagga 
aggaacaccc 
atgaagagcc 
gagtagcatc 
cagaaatgag 
aagaagaagc 
cccaggaaga 
accaggcctt 
atgagaagat 
tgacccacct 
cctggcaggc 
agtgtggctc 
gaaggaaaac 
gcgctctgac 
gctctctatt 
gagctaagaa 
ccccaatacc 
gtatttaatg 



gaagactgaa 
tctgcctaat 
agaatcaact 
aatggtttcc 
tattatgact 
catcaggact 
acagcatcag 
gtgcttgcgc 
cctagaacag 
tacccgctca 
cagctggact 
tctcaggagg 
tcctgatcaa 
aaggagttag 
ccagaggtgg 
actgaaagcc 
aagaccaact 
aaaaaaaaaa 



<210> 73 

<211> 1683 

<212> DNA 

<213> Homo sapiens 



<400> .73 

ctctgagtgt 

cttgacctat 

gaggcagctg 

cctatcccga 

gggacagagt 

gctgctacct 

tgtccccctc 

gccagctcca 

ctgccaacag 

gcggccttct 

tggtccctgg 

caatttcctg 

agagtgtgcc 

ggtcaagacc 

ccaggaggaa 

tatttcctcg 

gaacatctgc 

cctacaaagg 

ccaggggaac 

tgaccgggag 

tactcgggat 

gaccattgac 

agttctcacc 

tgacctgcca 

tgatctgtgt 

tcctggggat 

gtctggagac 

tatcagctcc 

atg 



ccagtggtca 
ttggctgttg 
aggccagagc 
agcctcccca 
caagtgcagc 
gcccaactcc 
caaaaagagc 
tttggggacc 
gaccggtccc 
ccagacaatc 
aacctaccac 
gagattggat 
aaacttgtgt 
cgaccccact 
gctccccagc 
ggtcttgagc 
cacctgaggc 
gagctgctgg 
aatcacacct 
tatgctgtga 
gagtgctttg 
atcagtcgag 
aagcataaac 
tttccagaac 
ggtccccgac 
gaacaggtca 
actgagaacg 
acctctgagc 



gttgccccag 
cttctgctgc 
actttcaaga 
tggatcaccc 
cccctccctc 
ctgctgaaaa 
tgccctctct 
agagccatcc 
aagggggctg 
tgaaccaaat 
agtccagcta 
attcccgctg 
gggaggaagc 
ggtgctgcac 
cacactacca 
tgcctttccc 
gcttccgctc 
cactgatcca 
gtacatggaa 
agacccacca 
cccgtcgggc 
tcacccccaa 
atattcctgg 
aggcctgctg 
gtaacatctg 
actgcttcaa 
ccaagggcca 
ccaaggaaga 



gatggggacc 
ctctgaggga 
agttggctac 
tgactcctct 
tcaggaggcc 
ggaagtgggt 
ccagcacccc 
agaacctgag 
gggccaccgg 
ctgccttcct 
ctcccacctc 
ctgccactgc 
aatgagccga 
gcggcagggg 
gctccgggcc 
tcctggggtg 
tgtgccacgc 
gctggagagg 
ggcctgggag 
ccacttgtgt 
tccttacccc 
cctcatgggc 
gctgatccac 
tgcagaggag 
gcgagaccct 
catcaattat 
gggggagcag 
atgagtcacc 



acagccagag 
ggcttcacgg 
gcagctcccc 
cagcatggcc 
acccctctcc 
ccccctctcc 
aatgaacaga 
tcctggaatg 
ctggatggct 
aaccgtcagc 
actcgccagg 
cgcagccaca 
ttctgtgagg 
gaggctcggt 
tgccccagcc 
cccacattgg 
aacctgccag 
gagttccagc 
gatacccttg 
tgccgccacc 
aactatgacc 
cacctctgtg 
aacatgactg 
gagaaattaa 
gccctctgct 
ctgaggaacg 
ggctcaactg 
ccagagccct 



cagccttggt 
ctacaggaca 
cctccccacc 
ctccctttga 
aacaggaaaa 
ctcaggaagc 
aggaaggaac 
cagcccagca 
tcccccctgg 
atgtggtata 
gtgagaccct 
caaaccgcct 
ccgagttctc 
tctcctgctt 
atcagcctga 
acaatatcaa 
ctactgaccc 
gctgctgccg 
acaaatactg 
ctcccagccc 
gggacatctt 
gaaaccaaag 
cccgctgctg 
ccttcatcaa 
gttacctgag 
tggctctagt 
gaggaacaaa 
agagggtcag 



2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3791 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1683 



<210> 74 
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<211> 1696 

<212> DNA 

<213> Homo sapiens 

<400> 74 

cacctaaaag ccaaaatggg aaaggaaaag actcatatca acattgtcgt cattggacac 60 

gtagattcgg gcaagtccac cactactggc catctgatct ataaatgcgg tggcatcgac 120 

aaaagaacca ttgaaaaatt tgagaaggag gctgctgaga tgggaaaggg ctccttcaag 180 

tatgcctggg tcttggataa actgaaagct gagcgtgaac gtggtatcac cattgatatc 240 

tccttgtgga aatttgagac cagcaagtac tatgtgacta tcattgatgc cccaggacac 300 

agagacttta tcaaaaacat gattacaggg acatctcagg ctgactgtgc tgtcctgatt 360 

gttgctgctg gtgttggtga atttgaagct ggtatctcca agaatgggca gacccgagag 420 

catgcccttc tggcttacac actgggtgtg aaacaactaa ttgtcggtgt taacaaaatg 480 

gattccactg agccacccta cagccagaag agatatgagg aaattgttaa ggaagtcagc 540 

acttacatta agaaaattgg ctacaacccc gacacagtag catttgtgcc aatttctggt 600 

tggaatggtg acaacatgct ggagccaagt gctaacatgc cttggttcaa gggatggaaa 660 

gtcacccgta aggatggcaa tgccagtgga accacgctgc ttgaggctgt ggactgcatc 720 

ctaccaccaa ctcgtccaac tgacaagccc ttgcgcctgc ctctccagga tgtctacaaa 780 

attggtggta ttggtactgt tcctgttggc cgagtggaga ctggtgttct caaacccggt 840 

atggtggtca cctttgctcc agtcaacgtt acaacggaag taaaatctgt cgaaatgcac 900 

catgaagctt tgagtgaagc tcttcctggg gacaatgtgg gcttcaatgt caagaatgtg 960 

tctgtcaagg atgttcgtcg tggcaacgtt gctggtgaca gcaaaaatga cccaccaatg 1020 

gaagcagctg gcttcactgc tcaggtgatt atcctgaacc atccaggcca aataagcgcc 1080 

ggctatgccc ctgtattgga ttgccacacg gctcacattg catgcaagtt tgctgagctg 1140 

aaggaaaaga ttgatcgccg ttctggtaaa aagctggaag atggccctaa attcttgaag 1200 

tctggtgatg ctgccattgt tgatatggtt cctggcaagc ccatgtgtgt tgagagcttc 1260 

tcagactatc cacctttggg tcgctttgct gttcgtgata tgagacagac agttgcggtg 1320 

ggtgtcatca aagcagtgga caagaaggct gctggagctg gcaaggtcac caagtctgcc 1380 

cagaaagctc agaaggctaa atgaatatta tccctaatac ctgccacccc actcttaatc 1440 

agtggtggaa gaacggtctc agaactgttt gtttcaattg gccatttaag tttagtagta 1500 

aaagactggt taatgataac aatgcatcgt aaaaccttca gaaggaaagg agaatgtttt 1560 

gtggaccact ttggttttct tttttgcgtg tggcagtttt aagttattag tttttaaaat 1620 

cagtactttt taatggaaac aacttgacca aaaatttgtc acagaatttt gagacccatt 1680 

aaaaaagtta aatgag 1696 

<210> 75 

<211> 7680 

<212> DNA 

<213> Homo sapiens 

<400> 75 

gaagagcaag aggcaggctc agcaaatggt tcagccccag tccccggtgg ctgtcagtca 60 

aagcaagccc ggttgttatg acaatggaaa acactatcag ataaatcaac agtgggagcg 120 

gacctaccta ggtaatgtgt tggtttgtac ttgttatgga ggaagccgag gttttaactg 180 

cgaaagtaaa cctgaagctg aagagacttg ctttgacaag tacactggga acacttaccg 240 

agtgggtgac acttatgagc gtcctaaaga ctccatgatc tgggactgta cctgcatcgg 300 

ggctgggcga gggagaataa gctgtaccat cgcaaaccgc tgccatgaag ggggtcagtc 360 

ctacaagatt ggtgacacct ggaggagacc acatgagact ggtggttaca tgttagagtg 420 

tgtgtgtctt ggtaatggaa aaggagaatg gacctgcaag cccatagctg agaagtgttt 480 

tgatcatgct gctgggactt cctatgtggt cggagaaacg tgggagaagc cctaccaagg 540 

ctggatgatg gtagattgta cttgcctggg agaaggcagc ggacgcatca cttgcacttc 600 

tagaaataga tgcaacgatc aggacacaag gacatcctat agaattggag acacctggag 660 

caagaaggat aatcgaggaa acctgctcca gtgcatctgc acaggcaacg gccgaggaga 720 

gtggaagtgt gagaggcaca cctctgtgca gaccacatcg agcggatctg gccccttcac 780 

cgatgttcgt gcagctgttt accaaccgca gcctcacccc cagcctcctc cctatggcca 840 

ctgtgtcaca gacagtggtg tggtctactc tgtggggatg cagtggttga agacacaagg 900 

aaataagcaa atgctttgca cgtgcctggg caacggagtc agctgccaag agacagctgt 960 
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aacccagact tacggtggca acttaaatgg agagccatgt gtcttaccat tcacctacaa 1020 

tggcaggacg ttctactcct gcaccacgga agggcgacag gacggacatc tttggtgcag 1080 

cacaacttcg aattatgagc aggaccagaa atactctttc tgcacagacc acactgtttt 1140 

ggttcagact caaggaggaa attccaatgg tgccttgtgc cacttcccct tcctatacaa 1200 

caaccacaat tacactgatt gcacttctga gggcagaaga gacaacatga agtggtgtgg 1260 

gaccacacag aactatgatg ccgaccagaa gtttgggttc tgccccatgg ctgcccacga 1320 

ggaaatctgc acaaccaatg aaggggtcat gtaccgcatt ggagatcagt gggataagca 1380 

gcatgacatg ggtcacatga tgaggtgcac gtgtgttggg aatggtcgtg gggaatggac 14 4 0 

atgcattgcc tactcgcaac ttcgagatca gtgcattgtt gatgacatca cttacaatgt 1500 

gaacgacaca ttccacaagc gtcatgaaga ggggcacatg ctgaactgta catgcttcgg 1560 

tcagggtcgg ggcaggtgga agtgtgatcc cgtcgaccaa tgccaggatt cagagactgg 1620 

gacgttttat caaattggag attcatggga gaagtatgtg catggtgtca gataccagtg 1680 

ctactgctat ggccgtggca ttggggagtg gcattgccaa cctttacaga cctatccaag 174 0 

ctcaagtggt cctgtcgaag tatttatcac tgagactccg agtcagccca actcccaccc 1800 

catccagtgg aatgcaccac agccatctca catttccaag tacattctca ggtggagacc 1860 

taaaaattct gtaggccgtt ggaaggaagc taccatacca ggccacttaa actcctacac 1920 

catcaaaggc ctgaagcctg gtgtggtata cgagggccag ctcatcagca tccagcagta 1980 

cggccaccaa gaagtgactc gctttgactt caccaccacc agcaccagca cacctgtgac 2040 

cagcaacacc gtgacaggag agacgactcc cttttctcct cttgtggcca cttctgaatc 2100 

tgtgaccgaa atcacagcca gtagctttgt ggtctcctgg gtctcagctt ccgacaccgt 2160 

gtcgggattc cgggtggaat atgagctgag tgaggaggga gatgagccac agtacctgga 2220 

tcttccaagc acagccactt ctgtgaacat ccctgacctg cttcctggcc gaaaatacat 2280 

tgtaaatgtc tatcagatat ctgaggatgg ggagcagagt ttgatcctgt ctacttcaca 2340 

aacaacagcg cctgatgccc ctcctgaccc gactgtggac caagttgatg acacctcaat 2400 

tgttgttcgc tggagcagac cccaggctcc catcacaggg tacagaatag tctattcgcc 24 60 

atcagtagaa ggtagcagca cagaactcaa ccttcctgaa actgcaaact ccgtcaccct 2520 

cagtgacttg caacctggtg ttcagtataa catcactatc tatgctgtgg aagaaaatca 2580 

agaaagtaca cctgttgtca ttcaacaaga aaccactggc accccacgct cagatacagt 264 0 

gccctctccc agggacctgc agtttgtgga agtgacagac gtgaaggtca ccatcatgtg 2700 

gacaccgcct gagagtgcag tgaccggcta ccgtgtggat gtgatccccg tcaacctgcc 27 60 

tggcgagcac gggcagaggc tgcccatcag caggaacacc tttgcagaag tcaccgggct 2820 

gtcccctggg gtcacctatt acttcaaagt ctttgcagtg agccatggga gggagagcaa 2880 

gcctctgact gctcaacaga caaccaaact ggatgctccc actaacctcc agtttgtcaa 2940 

tgaaactgat tctactgtcc tggtgagatg gactccacct cgggcccaga taacaggata 3000 

ccgactgacc gtgggcctta cccgaagagg ccagcccagg cagtacaatg tgggtccctc 3060 

tgtctccaag taccccctga ggaatctgca gcctgcatct gagtacaccg tatccctcgt 3120 

ggccataaag ggcaaccaag agagccccaa agccactgga gtctttacca cactgcagcc 3180 

tgggagctct attccacctt acaacaccga ggtgactgag accaccatcg tgatcacatg 3240 

gacgcctgct ccaagaattg gttttaagct gggtgtacga ccaagccagg gaggagaggc 3300 

accacgagaa gtgacttcag actcaggaag catcgttgtg tccggcttga ctccaggagt 3360 

agaatacgtc tacaccatcc aagtcctgag agatggacag gaaagagatg cgccaattgt 34 20 

aaacaaagtg gtgacaccat tgtctccacc aacaaacttg catctggagg caaaccctga 34 80 

cactggagtg ctcacagtct cctgggagag gagcaccacc ccagacatta ctggttatag 3540 

aattaccaca acccctacaa acggccagca gggaaattct ttggaagaag tggtccatgc 3600 

tgatcagagc tcctgcactt ttgataacct gagtcccggc ctggagtaca atgtcagtgt 3660 

ttacactgtc aaggatgaca aggaaagtgt ccctatctct gataccatca tcccagctgt 3720 

tcctcctccc actgacctgc gattcaccaa cattggtcca gacaccatgc gtgtcacctg 3780 

ggctccaccc ccatccattg atttaaccaa cttcctggtg cgttactcac ctgtgaaaaa 384/0 

tgaggaagat gttgcagagt tgtcaatttc tccttcagac aatgcagtgg tcttaacaaa 3900 

tctcctgcct ggtacagaat atgtagtgag tgtctccagt gtctacgaac aacatgagag 3960 

cacacctctt agaggaagac agaaaacagg tcttgattcc ccaactggca ttgacttttc 4020 

tgatattact gccaactctt ttactgtgca ctggattgct cctcgagcca ccatcactgg 4080 

ctacaggatc cgccatcatc ccgagcactt cagtgggaga cctcgagaag atcgggtgcc 4140 

ccactctcgg aattccatca ccctcaccaa cctcactcca ggcacagagt atgtggtcag 4200 

catcgttgct cttaatggca gagaggaaag tcccttattg attggccaac aatcaacagt 4260 

ttctgatgtt ccgagggacc tggaagttgt tgctgcgacc cccaccagcc tactgatcag 4 320 

ctgggatgct cctgctgtca cagtgagata ttacaggatc acttacggag aaacaggagg 4 380 
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aaatagccct 
ccttaaacct 
ccccgcaagc 
gatgcaagtg 
ccctgttact 
taaaactgca 
gtatgtggtt 
tgcagtaacc 
catcaaaatt 
gagccctgag 
agagctgcaa 
tgatatggag 
cctgaagttc 
tcagctcact 
aatcaacctt 
atatgaagtg 
tgtcaccact 
gaccaccatc 
tgccgttcca 
ctacaccatc 
tgacaatgct 
caacctgcgt 
tgccaggatt 
ggtccctcgg 
cgaatataca 
aaggaaaaag 
accagagatc 
gtatgacact 
gcaacaaatg 
ccccataagg 
gacaaccatc 
tgttggcact 
tctgacaggc 
gcagaggcat 
gaaccaacct 
agatgagtgg 
tggaagtggt 
caagattgga 
tcttgggaac 
tgggaagaca 
ctgcacatgc 
tgaacccagt 
tcagagaaca 
ggctgacaga 
ctctctgcca 
tttctttctt 
tctccaagca 
gcctgttctg 
tttggtttgg 
gatatgacca 
ctggcccgca 
cagtactcac 
ttatttatca 
atgcagttga 
gtgccaaagc 



gtccaggagt 
ggagttgatt 
agcaagccaa 
accgatgttc 
ggttacagag 
ggtccagatc 
agtgtctatg 
aacattgatc 
gcttgggaaa 
gatggaatcc 
ggcctcagac 
agccagcccc 
actcaggtca 
ggatatcgag 
gctcctgaca 
agtgtctatg 
ctggagaatg 
accattagct 
gccaatggcc 
acaggtttac 
cggagctccc 
ttcctggcca 
accggctaca 
ccccgccctg 
atttatgtca 
acagacgagc 
ttggatgttc 
ggaaatggta 
atctttgagg 
cataggccaa 
tcatgggccc 
gatgaagaac 
ctcaccagag 
aaggttcggg 
acggatgact 
gaacgaatgt 
catttcagat 
gagaagtggg 
ggaaaaggag 
taccacgtag 
tttggaggcc 
cccgaaggca 
aacactaatg 
gaagattccc 
agatccatct 
aagccctttg 
tcaccctggg 
cttcgaagta 
gatcaatagg 
aaattttaag 
atactgtagg 
tttttccaaa 
atttttccca 
taagaggaat 
tttactactg 



tcactgtgcc 
ataccatcac 
tttccattaa 
aggacaacag 
taaccaccac 
aaacagaaat 
ctcagaatcc 
gccctaaagg 
gcccacaggg 
atgagctatt 
cgggttctga 
tgattggaac 
cacccacaag 
tgcgggtgac 
gctcatccgt 
ctcttaagga 
tcagcccacc 
ggagaaccaa 
agactccaat 
aaccaggcac 
ctgtggtcat 
ccacacccaa 
tcatcaagta 
gtgtcacaga 
ttgccctgaa 
ttccccaact 
cttccacagt 
ttcagcttcc 
aacatggttt 
gaccataccc 
cattccagga 
ccttacagtt 
gtgccaccta 
aagaggttgt 
cgtgctttga 
ctgaatcagg 
gtgattcatc 
accgtcaggg 
aattcaagtg 
gagaacagtg 
agcggggctg 
ctactggcca 
ttaattgccc 
gagagtaaat 
aaactggagt 
ctctggagga 
agtttcctga 
ttcaataccg 
aaagcatatg 
taggaaagtc 
aacaagcatg 
tgatcctagt 
gtatttttat 
ttggtataat 
tggaaagaca 



tgggagcaag 
tgtgtatgct 
ttaccgaaca 
cattagtgtc 
tcccaaaaat 
gactattgaa 
aagcggagag 
actggcattc 
gcaagtttcc 
ccctgcacct 
gtacacagtc 
ccagtccaca 
cctgagcgcc 
ccccaaggag 
ggttgtatca 
cactttgaca 
aagaagggct 
gactgagacg 
ccagagaacc 
tgactacaag 
cgacgcctcc 
ttccttgctg 
tgagaagcct 
ggctactatt 
gaataatcag 
ggtaaccctt 
tcaaaagacc 
tggcacttct 
taggcggacc 
gccgaatgta 
cacttctgag 
cagggttcct 
caacatcata 
taccgtgggc 
cccctacaca 
ctttaaactg 
tagatggtgc 
agaaaatggc 
tgaccctcat 
gcagaaggaa 
gcgctgtgac 
gtcctacaac 
aattgagtgc 
catctttcca 
gatgttagca 
agttctccag 
gggttttctc 
ctcagtattt 
cagccaacca 
acccaaacac 
atcttgttac 
aattgcctag 
acggaaaaaa 
tatggtgggt 
actgttttaa 



tctacagcta 
gtcactggcc 
gaaattgaca 
aagtggctgc 
ggaccaggac 
ggcttgcagc 
agtcagcctc 
actgatgtgg 
aggtacaggg 
gatggtgaag 
agtgtggttg 
gctattcctg 
cagtggacac 
aagaccggac 
ggacttatgg 
agcagaccag 
cgtgtgacag 
atcactggct 
atcaagccag 
atctacctgt 
actgccattg 
gtatcatggc 
gggtctcctc 
actggcctgg 
aagagcgagc 
ccacacccca 
cctttcgtca 
ggtcagcaac 
acaccgccca 
ggacaagaag 
tacatcattt 
ggaacttcta 
gtggaggcac 
aactctgtca 
gtttcccatt 
ttgtgccagt 
catgacaatg 
cagatgatga 
gaggcaacgt 
tatctcggtg 
aaetgccgca 
cagtattctc 
ttcatgcctt 
atccagagga 
gacccagctt 
cttcagctca 
ataaatgagg 
taaatgaagt 
agatgcaaat 
ttctgctttc 
tgtgatattt 
aaatatcttt 
ttgtattgaa 
gattattttt 
taaaagattt 



ccatcagcgg 
gtggagacag 
aaccatccca 
cttcaagttc 
caacaaaaac 
ccacagtgga 
tggttcagac 
atgtcgattc 
tgacctactc 
aagacactgc 
ccttgcacga 
caccaactga 
cacccaatgt 
caatgaaaga 
tggccaccaa 
ctcagggtgt 
atgctactga 
tccaagttga 
atgtcagaag 
acaccttgaa 
atgcaccatc 
agccgccacg 
ccagagaagt 
aaccgggaac 
ccctgattgg 
atcttcatgg 
cccaccctgg 
ccagtgttgg 
caacggccac 
ctctctctca 
catgtcatcc 
ccagtgccac 
tgaaagacca 
acgaaggctt 
atgccgttgg 
gcttaggctt 
gtgtgaacta 
gctgcacatg 
gttacgatga 
ccatttgctc 
gacctggggg 
agagatacca 
tagatgtaca 
acaagcatgt 
agagttcttc 
actcacagct 
gctgcacatt 
gattctaaga 
gttttgaaat 
acttaagtgt 
taaatatcca 
ctcttacctg 
aacacttagt 
tatactgtat 
acattccaca 



4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
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<210> 76 

<211> 1316 

<212> DNA 

<213> Homo sapiens 



<400> 76 

tcctaatacg actcactata gggctcgagc ggccgcccgg gcaggtcgaa tgcaggcgac 60 

ttgcgagctg ggagcgattt aaaacgcttt ggattccccc ggcctgggtg gggagagcga 120 

gctgggtgcc ccctagattc cccgcccccg cacctcatga gccgaccctc ggctccatgg 180 

agcccggcaa ttatgccacc ttggatggag ccaaggatat cgaaggcttg ctgggagcgg 240 

gaggggggcg gaatctggtc gcccactccc ctctgaccag ccacccagcg gcgcctacgc 300 

tgatgcctgc tgtcaactat gcccccttgg atctgccagg ctcggcggag ccgccaaagc 360 

aatgccaccc atgccctggg gtgccccagg ggacgtcccc agctcccgtg ccttatggtt 420 

actttggagg cgggtactac tcctgccgag tgtcccggag ctcgctgaaa ccctgtgccc 4 80 

aggcagccac cctggccgcg taccccgcgg agactcccac ggccggggaa gagtacccca 540 

gtcgccccac tgagtttgcc ttctatccgg gatatccggg aacctaccac gctatggcca 600 

gttacctgga cgtgtctgtg gtgcagactc tgggtgctcc tggagaaccg cgacatgact 660 

ccctgttgcc tgtggacagt taccagtctt gggctctcgc tggtggctgg aacagccaga 720 

tgtgttgcca gggagaacag aacccaccag gtcccttttg gaaggcagca tttgcagact 780 

ccagcgggca gcaccctcct gacgcctgcg cctttcgtcg cggccgcaag aaacgcattc 840 

cgtacagcaa ggggcagttg cgggagctgg agcgggagta tgcggctaac aagttcatca 900 

ccaaggacaa gaggcgcaag atctcggcag ccaccagcct ctcggagcgc cagattacca . 960 

tctggtttca gaaccgccgg gtcaaagaga agaaggttct cgccaaggtg aagaacagcg 1020 

ctacccctta agagatctcc ttgcctgggt gggaggagcg aaagtggggg tgtcctgggg 1080 

agaccagaaa cctgccaagc ccaggctggg gccaaggact ctgctgagag gcccctagag 1140 

acaacaccct tcccaggcca ctggctgctg gactgttcct caggagcggc ctgggtaccc 1200 

agtatgtgca gggagacgga accccatgtg acaggcccac tccaccaggg ttcccaaaga 1260 

acctggccca gtcataatca ttcatcctca cagtggcaat aatcacgata accagt 1316 

* 

<210> 77 

<211> 566 

<212> DNA 

<213> Homo sapiens 



<400> 77 

cccaccaaac ccataaagag ggtgggtcga cccacgcgtc cgcggacgcg tgggaaatta 60 

ttgaattgga aacagaaata gaaaagttta aagctgagaa cgcatcttta gctaaacttc 120 

gcattgaacg agaaagtgcc ttggaaaaac tcaggaaaga aattgcagac ttcgaacaac 180 

agaaagcaaa agaattagct cgaatagaag agtttaaaaa ggaggagatg aggaagctac 240 

aaaaggaacg taaagttttt gaaaagtata ctacagctgc aagaactttt ccagataaaa 300 

aggaacgtga agaaatacag actttaaaac agcaaatagc agatttacgg gaagatttga 360 

aaagaaagga gaccaaatgg tcaagtacac acagccgtct cagaagccag atacaaatgt 420 

tagtcagaga gaacacagac ctccgggaag aaataaaagt gatggaaaga ttccgactgg 480 

atgcctggaa gagagcagaa gccatagaga gcagcctcga ggtggagaag aaggacaagc 540 

ttgcgaacac atctgttcga tttcaa . 566 

<210> 78 

<211> 5067 

<212> DNA 

<213> Homo sapiens 



<400> 78 

gcccggacac ctgtctgcag catggataag tatgacgacc tgggcctgga ggccagtaaa 60 

ttcatcgagg acctgaacat gtatgaggcc tctaaggatg ggctcttccg agtggacaag 120 

ggtgcaggca acaaccccga gtttgaggaa actcgcaggg tgttcgccac caagatggcc 180 

aaaatccacc tccagcagca gcagcagcag ctcctgcagg aggagactct gcccaggggg 240 

agtagaggcc ctgtcaatgg agggggccgc ctgggcccac aggcccgttg ggaagttgtg 300 
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ggcagcaagc tgactgtgga tggtgctgcc aagcctcctc ttgctgcctc gacaggggca 360 

cctggggcag tcaccaccct cgctgctggg cagcccccgt acccaccgca ggagcagaga 420 

tccaggccat acctgcatgg cacgaggcat ggcagccagg actgtggttc cagggagagc 480 

ctggcgactt ctgagatgtc tgctttccac cagccaggcc cctgtgagga tccttcctgc 540 

ctcactcatg gagactatta tgacaacctc tccttggcaa gcccaaagtg gggtgacaaa 600 

ccaggagtgt cccccagcat cggcctgagt gtagggagtg ggtggcctag ctccccgggg 660 

agtgacccac cactgcccaa accctgcggg gaccatcccc taaatcaccg acagctctcc 720 

ctgagctcca gcaggtcttc tgagggtagc ctcggtggtc agaatagtgg cattggtggc 780 

cgcagcagcg agaagccaac aggcctttgg tccactgcct cctcccagcg ggtgagccct 84 0 

ggcctgcctt ccccaaactt ggagaacgga gcaccagctg tggggcctgt tcagcccagg 900 

actccttctg tgtcagcacc cttggccctg agctgcccca ggcaaggagg tcttccaaga 960 

tcaaactcgg ggctgggggg tgaggtttca ggtgtgatgt ccaaacccaa tgtggacccc 1020 

caaccctggt tccaggatgg gcccaaatct tacctttcca gttctgcccc gtcatcctcg 1080 

ccagctggcc tggacggttc acagcagggt gcggtccctg ggctggggcc gaagcctggc 1140 

tgcacagacc ttggcactgg tcccaagctc agccccacca gtcttgtcca tccagtgatg 1200 

tccaccctgc ctgagttatc ttgtaaagag ggtcccctgg gctggtcttc tgatggtagc 1260 

ctgggatctg tgctcctgga cagccccagc tcccctaggg taaggctgcc ctgccagccc 1320 

ctcgtcccag gtcctgagct gagaccctct gctgctgagt tgaaattaga agccctcacc 1380 

caacgtctgg agcgagagat ggatgctcac ccgaaggctg attactttgg agcctgtgtg 14 4 0 

aaatgcagca aaggggtgtt tggggctggc caggcctgtc aggccatggg gaacctctac 1500 

catgacacat gcttcacctg tgcagcttgc agccggaagc tgagaggaaa agccttttat 1560 

tttgtcaacg gcaaagtgtt ttgtgaagaa gacttcctgt actctggttt ccagcagtcg 1620 

gctgacaggt gttttctttg tggacatctg atcatggaca tgatcctgca agccctgggg 1680 

aagtcctacc accccggctg tttccgctgt gtcatctgta atgagtgttt ggatggggtg 1740 

cccttcaccg tggactcaga gaacaagatc tactgtgtcc gagattacca caaggtgctg 1800 

gcccccaagt gtgcagcctg tgggcttccc atccttccac ctgagggctc agatgagacc 1860 

atccgtgtcg tgtccatgga cagagactac cacgtggagt gttaccactg cgaggactgt 1920 

ggtctggagc tcaatgatga agatggccac cgctgttatc cgctggagga ccacctgttc 1980 

tgtcactcct gccacgtgaa gaggctggag aagagaccct catctacagc ccttcaccag 2040 

caccacttct agccagagcc acttgcagac atcacggcag gggatgagga gccggggttg 2100 

ctgctgctgc ttccggtggc ccctggggtg gaagtggggt aggggaagag gaggggcagg 2160 

agggagagtt cctgtgagca tgtggggggt gcctttcctt taaccaggga ggtgaacact 2220 

acctgcctcc tgcgtgtatt ttccaagtgc ttttctctgt tgccacattt tcctcaggtt 2280 

actcaggaaa atgctccagc atgtgcgagc acatgacctg aggttgcatc atagcaccaa 2340 

aggaatcctc ctgtcccctc tgggaacatt tcatgcttca gagggagagg tttttattga 2400 

gcttgtttca caatatcccc ttgaagggac agctcagctg ccaatacatt caaccctttc 24 60 

tcttccttca ggaaaatacc tatacccaaa tgttccctcc cccgacatat atcatggcat 2520 

gacttaaggc ttcttttcac ctgagagctt cagttcttct gcagaatggc tgcaaattta 2580 

attgcattaa ggcaagaagg aagctctaat gtgtgctttg tatcctaaga taaatttgct 2640 

tagaaaacca gagtcaagat ttgaaatagg tgaggcaggg tttcctcctt agacactgac 2700 

agcattctcc gtaccccttc aaatccttac tctcctaaag gcagctgagt ccgcgacaga 2760 

aatttgccct atgggagtaa aacatacttt gggagaagaa cttggtgcag gcaccaggat 2820 

tttttttttt gcccacgtgt ttgcgctgtt tttctctgga gttctcaaga gttggtgact 2880 

tggaaggccg cttctgcaag gcaagtctca ggaacccatg caggtacatc gcttgcacct 2940 

gtttttagct tatttaatga cgggcttttg ggaagagctg cccgcatact gagagacagc 3000 

ttcttataaa caaggagagt ttttgtgtgt gcgagatctc taagccagcg tgggagggag 3060 

cgcctcagga taagttatta tattcatttc gttggtttct ctcctgccca attcttggca 3120 

caggcattat gtttgaagaa accaggataa ggtacactgc ttttgtctgt ttaatttttt 3180 

tagttgtttc ccttcacttt cagtcttcca cacacaaaaa atacctcaca gagcttcacc 324 0 

aaatcacaga ttcaggagga atttggcttt cacactggac tcagatacct tcttcagtgt 3300 

gttggaaatc actggcttca cacaggccca actccaactg gtcagggcag agtgatcgta 3360 

actaaaggtc agtggggaat agatccgatt cagtgctttt gccttatgca tttcagcatc 3420 

ctggctcccc agggtggcag gagctgagga agggccacac actggcaaga tttcaagacc 3480 

actctctgca ctgaagaggt aaaatttgca ctgcaagtca catccctgag gccagaggtc 3540 

agtacccttt ggtatttcga ttagaagaag ctgcaaaaga aaggcagccc attttaccat 3600 

tgccagccag gccggggaca caggagccgg tgtgtgcact ctgcctccta acattgcacc 3660 

cagagcaaga ggactgggtg ctgggctgca gaggccggtc agtggagccc ctagcacgtg 3720 
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tgaactcagg cttttcattg ggcccggctc cacttctagg ccatgttttg actcatttgg 3780 

taaccattgc ctgtaagcag cacagaattg gtgccatgga ttatcttttc catgttgatg 3840 

gaattcattc tgttggaatc ctttggccag atgtcacttc agccagggtg tgcatcatca 3900 

ttggttcttt ttcacaggct gagcctcctg aaaacccatg aacgctgggg ctggggaagt 3960 

gaaccctgag gtggggaccc tctcttccca tcaaatcatc cagctcagtg tggggcgtgg 4020 

caggggggta aatgaagcca gccaatgtgt taacctgtct ctgtcaacct aagaatgttg 4080 

gccttactga cacacctttg ctccatgttc aagaccagaa gtagctggga tttgtttgca 4140 

aattgggtaa ttagtttaaa aatctgtgat tacattttta aatgaaattt tcaaagtggc 4200 

ctagattgag gtgattcaga taggtttgcg aatataccat tttatattgt tgagaaagaa 4260 

caaaaaggga atttccagat gtcctagaaa tcctagcaac agatttctct ggttgtcagt 4320 

ttccctggag aaggcgccag ataggaatct ccaatcagtt gtttttctct tcgcttcagg 4380 

cccttacaca aaagccatga agagatgttc acctacccgg tattttaaat gttctgtaaa 4440 

ttattagcca aatagaactg taatggggtt gtatttatgg gcgcctagaa agaaaacaca 4500 

aggacttggt aggccaggaa gaaaagattt taaaatttag aatgaatagc ccttctgggt 4560 

tttctttttg acaattcttg gacttgaggt aaaacaagga ggattgtggc cggatttcag 4620 

atcccaaagc cagcctccat cttaggcctt tgcctcattg tgccttttag gttttcttac 4 680 

ccaccgtctc ctgttttgtc ttttttttct tttctcctac ccctatcttg ggacattcag 4740 

aaactgcctg ggtggtttga gaagagacaa cccagtttga tctgcaatac aaggatccat 4 800 

tcgtaatctc tctctcactg atgttattcc cccatctgcc gtcttggttc atctcaccac 4860 

agaagggcat ttagtcctac ccagccatcg gctgcggtat gacagcagga tggcacttcc 4 920 

catttctctg tggttagtgc tcgagtgaaa acctctttca gctgagtcct ctgaggttct 4 980 

gctgttgagt cctgggtggc tgatggaatg attgaggagg tctggtcacc ctcaagcgcc 5040 

gtcatcgcct tgtttccatg ggcttct 5067 



<210> 79 

<211> 950 

<212> DNA 

<213> Homo sapiens 

<400> 79 

tcgaccggat ccgaattccc attgtgcact aaagcgtctc cctgctccgc ggcccgggct 60 

ggcgggcggg cgctcggctg gcggctgcag cagcagaggg agacccgcgg caaccccggc 120 

aacccagggc tcggcgtcgc tgccaccatg acgggaagca atatgtcgga cgccttggcc 180 

aacgccgtgt gccagcgctg ccaggcccgc ttctcccccg ccgagcgcat tgtcaacagc 240 

aatggggagc tgtaccatga gcactgcttc gtgtgtgccc agtgcttccg gcccttcccc 300 

gaggggctct tctatgagtt tgaaggccgg aagtactgcg aacacgactt ccaaatgctg 360 

tttgctccgt gctgtggatc ctgcggtgag ttcatcattg gccgcgtcat caaggccatg 420 

aacaacaact ggcacccggg ctgcttccgc tgcgagctgt gtgatgtgga gctggctgac 480 

ctgggctttg tgaagaatgc cggcaggcat ctctgccggc cttgccacaa ccgtgagaag 540 

gccaagggcc tgggcaagta catctgccag cggtgccacc tggtcatcga cgagcagccc 600 

ctcatgttca ggagcgacgc ctaccaccct gaccacttca actgcaccca ctgtgggaag 660 

gagctgacag ccgaggcccg cgagctgaag ggtgagctct actgcctgcc ctgccatgac 720 

aagatgggcg ttcccatctg cggggcctgc cgccggccca tcgagggccg agtggtcaac 780 

gcgctgggca agcagtggca cgtggagcac tttgtctgtg ccaagtgtga gaagccattc 840 

ctggggcacc ggcactatga gaagaagggc ctggcctact gcgagcttta gtgcacaatg 900 

ggcggccgcc tcgagtctag actcgagtag ataattgagc ggaatttctt 950 

<210> 80 

<211> 2346 

<212> DNA 

<213> Homo sapiens 

<400> 80 

ccgccgtcgc ccccgcctcc ccctgcctca gcggctgccc ccgccagcgg gccgcccgct 60 

cccccgggcc ttgcagcggg ccccggcccg gctggagggg ccccgacccc agctctggtg 120 

gcgggcagca gcgccgcggc ccccttccct cacggggact cggccctgaa cgagcaggag 180 
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aaggagttgc agcggcggct gaagcgtctc tacccggccg tggacgaaca agagacgccg 240 

ctgcctcggt cctggagccc gaaggacaag ttcagctaca tcggcctctc tcagaacaac 300 

ctgcgggtgc actacaaagg tcatggcaaa accccaaaag atgccgcgtc agttcgagcc 360 

acgcatccaa taccagcagc ctgtgggatt tattattttg aagtaaaaat tgtcagtaag 420 

ggaagagatg gttacatggg aattggtctt tctgctcaag gtgtgaacat gaatagacta 480 

ccaggttggg ataagcattc atatggttac catggggatg atggacattc gttttgttct 540 

tctggaactg gacaacctta tggaccaact ttcactactg gtgatgtcat tggctgttgt 600 

gttaatctta tcaacaatac ctgcttttac accaagaatg gacatagttt aggtattgct 660 

ttcactgacc taccgccaaa tttgtatcct actgtggggc ttcaaacacc aggagaagtg 720 

gtcgatgcca attttgggca acatcctttc gtgtttgata tagaagacta tatgcgggag 780 

tggagaacca aaatccaggc acagatagat cgatttccta tcggagatcg agaaggagaa 840 

tggcagacca tgatacaaaa aatggtttca tcttatttag tccaccatgg gtactgtgcc 900 

acagcagagg cctttgccag atctacagac cagaccgttc tagaagaatt agcttccatt 960 

aagaatagac aaagaattca gaaattggta ttagcaggaa gaatgggaga agccattgaa 1020 

acaacacaac agttataccc aagtttactt gaaagaaatc ctaatctcct tttcacatta 1080 

aaagtgcgtc agtttataga aatggtgaat ggtacagata gtgaagtacg atgtttggga 1140 

ggccgaagtc caaagtctca agacagttat cctgttagtc ctcgaccttt tagtagtcca 1200 

agtatgagcc ccagccatgg aatgaatatc cacaatttag catcaggcaa aggaagcacc 1260 

gcacattttt caggttttga aagttgtagt aatggtgtaa tatcaaataa agcacatcaa 1320 

tcatattgcc atagtaataa acaccagtca tccaacttga atgtaccaga actaaacagt 1380 

ataaatatgt caagatcaca gcaagttaat aacttcacca gtaatgatgt agacatggaa 14 40 

acagatcact actccaatgg agttggagaa acttcatcca atggtttcct aaatggtagc 1500 

tctaaacatg accacgaaat ggaagattgt gacaccgaaa tggaagttga ttcaagtcag 1560 

ttgagacgcc agttgtgtgg aggaagtcag gccgccatag aaagaatgat ccactttgga 1620 

cgagagctgc aagcaatgag tgaacagcta aggagagact gtggcaagaa cactgcaaac 1680 

aaaaaaatgt tgaaggatgc attcagtcta ctagcatatt cagatccctg gaacagccca 1740 

gttggaaatc agcttgaccc gattcagaga gaacctgtgt gctcagctcf taacagtgca 1800 

atattagaaa cccacaatct gccaaagcaa cctccacttg ccctagcaat gggacaggcc 18 60 

acacaatgtc taggactgat ggctcgatca ggaattggat cctgcgcatt tgccacagtg 1920 

gaagactacc tacattagct atgcatttca agagctcaca cttatattgt ggcatatagt 1980 

caacatggaa gtagaccagc tctgctgatt tgaaatttag attttttaaa ttatgtactg 204 0 

gggacaggtt tttgtcgctt tacattgctt cctagtttac agcatgatgc aaatgatttt 2100 

ctaacttagt gttaggagaa attattttcc atctttaacc tcttagttgt ctaagagtta 2160 

aatattactg aatttcagac gttcaaattg atcatcacaa atcctttaaa acaattacct 2220 

aaaagaaacc aaaaatcctg ccttctttgt gggggagggg ggagagaggg gaaggaaatg 2280 

gaacaagttg tgtttgtgtt agcatgtggg tgatgtaaac ttcaaattgg gagatgttcc 234 0 

gacccc 234 6 

<210> 81 

<211> 2512 

<212> DNA 

<213> Homo sapiens 

<400> 81 

caatgcactg acggatatga gtgggatcct gtgagacagc aatgcaaaga tattgatgaa 60 

tgtgacattg tcccagacgc ttgtaaaggt ggaatgaagt gtgtcaacca ctatggagga 120 

tacctctgcc ttccgaaaac agcccagatt attgtcaata atgaacagcc tcagcaggaa 180 

acacaaccag cagaaggaac ctcaggggca accaccgggg ttgtagctgc cagcagcatg 240 

gcaaccagtg gagtgttgcc cgggggtggt tttgtggcca gtgctgctgc agtcgcaggc 300 

cctgaaatgc agactggccg aaataacttt gtcatccggc ggaacccagc tgaccctcag 360 

cgcattccct ccaacccttc ccaccgtatc cagtgtgcag caggctacga gcaaagtgaa 420 

cacaacgtgt gccaagacat agacgagtgc actgcaggga cgcacaactg tagagcagac 480 

caagtgtgca tcaatttacg gggatccttt gcatgtcagt gccctcctgg atatcagaag 540 

cgaggggagc agtgcgtaga catagatgaa tgtaccatcc ctccatattg ccaccaaaga 600 

tgcgtgaata caccaggctc attttattgc cagtgcagtc ctgggtttca attggcagca 660 

aacaactata cctgcgtaga tataaatgaa tgtgatgcca gcaatcaatg tgctcagcag 720 

tgctacaaca ttcttggttc attcatctgt cagtgcaatc aaggatatga gctaagcagt 780 
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gacaggctca actgtgaaga cattgatgaa tgcagaacct caagctacct gtgtcaatat 84 0 

caatgtgtca atgaacctgg gaaattctca tgtatgtgcc cccagggata ccaagtggtg 900 

agaagtagaa catgtcaaga tataaatgag tgtgagacca caaatgaatg ccgggaggat 960 

gaaatgtgtt ggaattatca tggcggcttc cgttgttatc cacgaaatcc ttgtcaagat 1020 

ccctacattc taacaccaga gaaccgatgt gtttgcccag tctcaaatgc catgtgccga 1080 

gaactgcccc agtcaatagt ctacaaatac atgagcatcc gatctgatag gtctgtgcca 1140 

tcagacatct tccagataca ggccacaact atttatgcca acaccatcaa tacttttcgg 1200 

attaaatctg gaaatgaaaa tggagagttc tacctacgac aaacaagtcc tgtaagtgca 1260 

atgcttgtgc tcgtgaagtc attatcagga ccaagagaac atatcgtgga cctggagatg 1320 

ctgacagtca gcagtatagg gaccttccgc acaagctctg tgttaagatt gacaataata 1380 

gtggggccat tttcatttta gtcttttcta agagtcaacc acaggcattt aagtcagcca 1440 

aagaatattg ttaccttaaa gcactatttt atttatagat atatctagtg catctacatc 1500 

tctatactgt acactcaccc ataacaaaca attacaccat ggtataaagt gggcatttaa 1560 

tatgtaaaga ttcaaagttt gtctttatta ctatatgtaa attagacatt aatccactaa 1620 

actggtcttc ttcaagagag ctaagtatac actatctggt gaaacttgga ttctttccta 1680 

taaaagtggg accaagcaat gatgatcttc tgtggtgctt aaggaaactt actagagctc 174 0 

cactaacagt ctcataagga ggcagccatc ataaccattg aatagcatgc aagggtaaga 1800 

atgagttttt aactgctttg taagaaaatg gaaaaggtca ataaagatat atttctttag 1860 

aaaatgggga tctgccatat ttgtgttggt ttttattttc atatccagcc taaaggtggt 1920 

tgtttattat atagtaataa atcattgctg tacaacatgc tggtttctgt agggtatttt 1980 

taattttgtc agaaatttta gattgtgaat attttgtaaa aaacagtaag caaaattttc 2040 

cagaattccc aaaatgaacc agataccccc tagaaaatta tactattgag aaatctatgg 2100 

ggaggatatg agaaaataaa ttccttctaa accacattgg aactgacctg aagaagcaaa 2160 

ctcggaaaat ataataacat ccctgaattc aggcattcac aagatgcaga acaaaatgga 2220 

taaaaggtat ttcactggag aagttttaat ttctaagtaa aatttaaatc ctaacacttc 2280 

actaatttat aactaaaatt tctcatcttc gtacttgatg ctcacagagg aagaaaatga 2340 

tgatggtttt tattcctggc atccagagtg acagtgaact taagcaaatt accctcctac 2400 

ccaattctat ggaatatttt atacgtctcc ttgtttaaaa tctgactgct ttactttgat 2460 

gtatcatatt tttaaataaa aataaatatt cctttagaag atcactctaa aa 2512 

<210> 82 

<211> 2306 

<212> DNA 

<213> Homo sapiens 

<400> 82 

gggcgggagc tgcacgcgcc gtggctccgg atctcttcgt ctttgcagcg tacgcccgag 60 

tcggtcagcg ccggaggacc tcagcagcca tgtcgaagcc ccatagtgaa gccgggactg 120 

ccttcattca gacccagcag ctgcacgcag ccatggctga cacattcctg gagcacatgt 180 

gccgcctgga cattgattca ccacccatca cagcccggaa cactggcatc atctgtacca 240 

ttggcccagc ttcccgatca gtggagacgt tgaaggagat gattaagtct ggaatgaatg 300 

tggctcgtct gaacttctct catggaactc atgagtacca tgcggagacc atcaagaatg 360 

tgcgcacagc cacggaaagc tttgcttctg acccctacct ctaccggccc gttgctgtgg 420 

ctctagacac taaaggacct gagatccgaa ctgggctcat caagggcagc ggcactgcag 480 

agctggagct gaagaaggga gccactctca aaatcacgct ggataacgcc tacatggaaa 540 

agtgtgacga gaacatcctg tggctggact acaagaacat ctgcaaggtg gtggaagtgg 600 

gcagcaagat ctacgtggat gatgggctta tttctctcca ggtgaagcag aaaggtgccg 660 

acttcctggt gacggaggtg gaaaatggtg gctccttggg cagcaagaag ggtgtgaacc 720 

ttcctggggc tgctgtggac ttgcctgctg tgtcggagaa ggacatccag gatctgaagt 780 

ttggggtcga gcaggatgtt gatatggtgt ttgcgtcatt catccgcaag gcatctgatg 840 

tccatgaagt taggaaggtc ctgggagaga agggaaagaa catcaagatt atcagcaaaa 900 

tcgagaatca tgagggggtt cggaggtttg atgaaatcct ggaggccagt gatgggatca 960 

tggtggctcg tggtgatcta ggcattgaga ttcctgcaga gaaggtcttc cttgctcaga 1020 

agatgatgat tggacggtgc aaccgagctg ggaagcctgt catctgtgct actcagatgc 1080 

tggagagcat gatcaagaag ccccgcccca ctcgggctga aggcagtgat gtggccaatg 1140 

cagtcctgga tggagccgac tgcatcatgc tgtctggaga aacagccaaa ggggactatc 1200 

ctctggaggc tgtgcgcatg cagcacctga ttgcccgtga ggcagaggct gccatctacc 1260 
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acttgcaatt atttgaggaa ctccgccgcc tggcgcccat taccagcgac cccacagaag 1320 

ccaccgccgt gggtgccgtg gaggcctcct tcaagtgctg cagtggggcc ataatcgtcc 1380 

tcaccaagtc tggcaggtct gctcaccagg tggccagata ccgcccacgt gcccccatca 144 0 

ttgctgtgac ccggaatccc cagacagctc gtcaggccca cctgtaccgt ggcatcttcc 1500 

ctgtgctgtg caaggaccca gtccaggagg cctgggctga ggacgtggac ctccgggtga 1560 

actttgccat gaatgttggc aaggcccgag gcttcttcaa gaagggagat gtggtcattg 1620 

tgctgaccgg atggcgccct ggctccggct tcaccaacac catgcgtgtt gttcctgtgc 1680 

cgtgatggac cccagagccc ctcctccagc ccctgtccca cccccttccc ccagcccatc 1740 

cattaggcca gcaacgcttg tagaactcac tctgggctgt aacgtggcac tggtaggttg 1800 

ggacaccagg gaagaagatc aacgcctcac tgaaacatgg ctgtgtttgc agcctgctct 1860 

agtgggacag cccagagcct ggctgcccat catgtggccc cacccaatca agggaagaag 1920 

gaggaatgct ggactggagg cccctggagc cagatggcaa gagggtgaca gcttcctttc 1980 

ctgtgtgtac tctgtccagt tcctttagaa aaaatggatg cccagaggac tcccaaccct 2040 

ggcttggggt caagaaacag ccagcaagag ttaggggcct tagggcactg ggctgttgtt 2100 

ccattgaagc cgactctggc cctggccctt acttgcttct ctagctctct aggcctctcc 2160 

agtttgcacc tgtccccacc ctccactcag ctgtcctgca gcaaacactc caccctccac 2220 

cttccatttt cccccactac tgcagcacct ccaggcctgt tgctatagag cctacctgta 2280 

tgtcaataaa caacagctga agcacc 2306 



<210> 83 

<211> 2656 

<212> DNA 

<213> Homo sapiens 



<400> 83 

gaattcgcgg 

gctggtggtt 

cagggagggc 

ttccctgacc 

aggaacaaaa 

caaagaagat 

gaaagcctat 

agccaaggag 

gaggaagcag 

gcatagctac 

tggcgagttc 

atacttcatg 

gaacatcatg 

gcagcattgc 

gatgcgttcc 

ggtctgcagg 

aggagtcgag 

gttcagggtg 

ctttatttct 

gacgatcaac 

gaaaggcatc 

acgagttcca 

cgagacagat 

catggatagc 

cctttccaaa 

tggaaactag 

gattccagaa 

gcaaaatcat 

aggatgcagc 

gtcagcagag 

ggatatctta 

ttacgaagtg 



ccgcagagtc 
gtggggaccc 
gtggtggggg 
tcttgcggcc 
cataaccctt 
tattatcaga 
tatcagcttg 
aagttctccc 
tacgatgcct 
tggaagggag 
tcatcctctt 
gagttgacat 
gacacgtgtg 
cactactgtg 
acgtgtagga 
ggagcaggac 
gatggccaga 
cagaaaagcc 
atagctcagg 
gtgacgatcc 
ccccggatta 
aagaggctaa 
gtggagggga 
tccgcaggaa 
cttaagaaaa 
gccgggaagc 
cagcagcact 
gggacaacac 
cccagaggct 
gagaggttag 
gttaaaggcc 
attaatttcc 



cccgggccaa 
cgcggctgcc 
catggctgag 
cccgagcgct 
tcatttgtac 
tattaggagt 
ccaagaagta 
agctggcaga 
acggctctgc 
gccccactgt 
catttggaga 
tcaatcaagc 
agcgctgcaa 
gcggctccgg 
gatgtggtgg 
aagccaagca 
ccgtgaggat 
ctgtgttccg 
ctcttcttgg 
cccctgggac 
acagctacgg 
cgagccggca 
cggtgaacgg 
gcaaggctag 
tgtttacctc 
agcagcccct 
gagctcccac 
ctctctccac 
ggtggcagtt 
atcttgcagg 
atgcttacag 
ttctcagcaa 



gatggctgcg 
ggctatatcg 
ccgcaagctg 
gctgacattg 
tgcctccttc 
gcctcgaaat 
tcaccctgac 
agcctatgag 
aggcttcgat 
ggaccccgag 
tttccagacc 
tgcaaagggg 
cggcaagggg 
catggaaacc 
ccgcggctcc 
gaaaaagcga 
gcctgtggga 
gagggacggc 
gggaacagcc 
tcagacagac 
ctacggagac 
gcagagcctg 
cgtcaccctc 
gcgtgaggct 
atgatatccc 
ccaagggcca 
ccgcagagcc 
ggaaaggtca 
tcctgtccat 
ctaaaactct 
cttagaaatg 
acctccggga 



cggtgctcca 
ggtagagggg 
agcgtccccg 
agacctggtg 
cacacgagtg 
gccagccaga 
acaaataagg 
gttttgagtg 
cctggggcca 
gagctgttca 
gtgtttgatc 
gtcaacaagg 
aacgagcccg 
atcaacacag 
atcatcatat 
gtgatgatcc 
aaaagggaaa 
gcagacatcc 
agagcccagg 
cagaagattc 
cactacatcc 
atcctgagct 
accagctctg 
ggggaggacg 
agccgaggaa 
gggcacctgg 
tctggacggc 
cagtggacag 
tggtaggtga 
aatttggaat 
aagccttaag 
ggttccagaa 



cacgctggtt 
cccggccgcc 
cctttgcgtc 
tcagccttac 
cccctttggc 
aagagatcaa 
atgatcccaa 
atgaggtgaa 
gcggctccca 
ggaagatctt 
agcctcagga 
agttcaccgt 
gcaccaaggt 
gcccttttgt 
cgccctgtgt 
ctgtgcctgc 
ttttcattac 
actccgacct 
gcctgtacga 
ggatgggtgg 
acatcaagat 
acgccgagga 
gtggcagcac 
aggagggatt 
aaagatccac 
gagacgggag 
cttggcaaca 
cccgggcagt 
cggccccctg 
tgaatattgt 
ctgcatcaag 
tgagttcttc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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ctgacaggtt gtcttcactg ggagcgtggg gcccccaggc cccaccagca ccgtcctccc 1980 

ctaatgaggg gccctgccga ggcatcagct gctctgctca gttagttttt attcccgggg 2040 

taccaagcag ctgcacagtc ggtgcctggg aagcacgtta aaggcccaga gagatcctgg 2100 

gggttctgct ctgaccgtgt gggtggtgat ccttgtcagg atgtacagtc cttgctccca 2160 

ccccatccgg gatggccgcc tgtccctgac tattgagtcc tgttgttgta agccaggcat 2220 

ggagggctcc tgcccttctg ctgagccaca gcccattgca gcactgtgct ggccagactt 2280 

cagctgcctt gggaactgaa gccctgccac tgttgctagt caggggcttg gttctcccac 2340 

ttacactgtt gacatctatt ttctgaagtg tgtttaaatt attcagtgct aatcattgtt 2400 

ttttcctttg taaatgttga ttcagaaaag gaaagcacag gctaagcagt tgaaggttcc 2460 

ccaccattca gtgagagcag aacccccatt ccccagcctc tgctggtagc atgtcgcagt 2520 

ttccatgtgt ttcaggatct tcgggctgtc gttagacagg ttaatgaaga acacttctca 2580 

acagtttcct ttttgttttc ctttataatt cactaaaata aagcatctat tagtgtctga 2640 

aaaaaaaaaa aaaaaa 2656 

<210> 84 
<211> 2217 
<212> DNA 

<213> Homo sapiens 
<400> 84 

gcggacccgg cgccgaggcg gccacccgag acgcggcgcg cacgctccgg cctgcgcagc 60 

ccggcccggc catggcggcc ccccgcccgt ctcccgcgat ctccgtttcg gtctcggctc 120 

cggcttttta cgccccgcag aagaagttcg gccctgtggt ggccccaaag cccaaagtga 180 

atcccttccg gcccggggac agcgagcctc ccccggcacc cggggcccag cgcgcacaga 240 

tgggccgggt gggcgagatt cccccgccgc ccccggaaga ctttcccctg cctccacctc 300 

cccttgctgg ggatggcgac gatgcagagg gtgctctggg aggtgccttc ccgccgcccc 360 

ctcccccgat cgaggaatca tttccccctg cgcctctgga ggaggagatc ttcccttccc 420 

cgccgcctcc tccggaggag gagggagggc ctgaggcccc cataccgccq ccaccacagc 480 

ccagggagaa ggtgagcagt attgatttgg agatcgactc tctgtcctca ctgctggatg 540 

acatgaccaa gaatgatcct ttcaaagccc gggtgtcatc tggatatgtg cccccaccag 600 

tggccactcc attcagttcc aagtccagta ccaagcctgc agccgggggc acagcacccc 660 

tgcctccttg gaagtcccct tccagctccc agcctctgcc ccaggttccg gctccggctc 720 

agagccagac acagttccat gttcagcccc agccccagcc caagcctcag gtccaactcc 780 

atgtccagtc ccagacccag cctgtgtctt tggctaacac ccagccccga gggcccccag 840 

cctcatctcc ggctccagcc cctaagtttt ctccagtgac tcctaagttt actcctgtgg 900 

cttccaagtt cagtcctgga gccccaggtg gatctgggtc acaaccaaat caaaaattgg 960 

ggcaccccga agctctttct gctggcacag gctcccctca acctcccagc ttcacctatg 1020 

cccagcagag ggagaagccc cgagtgcagg agaagcagca ccccgtgccc ccaccggctc 1080 

agaaccaaaa ccaggtgcgc tcccctgggg ccccagggcc cctgactctg aaggaggtgg 1140 

aggagctgga gcagctgacc cagcagctaa tgcaggacat ggagcatcct cagaggcaga 1200 

atgtggctgt caacgaactc tgcggccgat gccatcaacc cctggcccgg gcgcagccag 1260 

ccgtccgcgc tctagggcag ctgttccaca tcgcctgctt cacctgccac cagtgtgcgc 1320 

agcagctcca gggccagcag ttctacagtc tggagggggc gccgtactgc gagggctgtt 1380 

acactgacac cctggagaag tgtaacacct gcggggagcc catcactgac cgcatgctga 1440 

gggccacggg caaggcctat cacccgcact gcttcacctg tgtggtctgc gcccgccccc 1500 

tggagggcac ctccttcatc gtggaccagg ccaaccggcc ccactgtgtc cccgactacc 1560 

acaagcagta cgccccgagg tgctccgtct gctctgagcc catcatgcct gagcctggcc 1620 

gagatgagac tgtgcgagtg gtcgccctgg acaagaactt ccacatgaag tgttacaagt 1680 

gtgaggactg cgggaagccc ctgtcgattg aggcagatga caatggctgc ttccccctgg 1740 

acggtcacgt gctctgtcgg aagtgccaca ctgctagagc ccagacctga gtgaggacag 1800 

gccctcttca gaccgcagtc catgccccat tgtggaccac ccacactgag accacctgcc 1860 

cccacctcag ttattgtttt gatgtctagc ccctcccatt tccaacccct ccctagcatc 1920 

ccaggtgccc tgacccagga cccaacatgg tctagggatg caggatcccc gccctggggt 1980 

ctggtcctcg cccatcctgc agggattgcc caccgtcttc cagacacccc acctgagggg 2040 

ggcaccaggt ttagtgctgc tgctttcact gctgcacccg cgccctcggc cggccccccg 2100 

agcagccttt gtactctgct tgcggagggc tgggagaccc tccaggacat tcccaccctc 2160 

ccccatgctg ccaagttgta gctatagcta caaataaaaa aaaaccttgt tttccag 2217 
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<210> 85 

<211> 8906 

<212> DNA 

<213> Homo sapiens 

<400> 85 

gaggcggcca aggacctggc cgacatcgcg gccttcttcc gatccgggtt tcgaaaaaac 60 

gatgaaatga aagctatgga tgttttacca attttgaagg aaaaagttgc atacctttca 120 

ggtgggagag ataaacgtgg aggtcccatt ttaacgtttc cggcccgcag caatcatgac 180 

agaatacgac aggaggatct caggagactc atttcctatc tagcctgtat tcccagcgag 24 0 

gaggtctgca agcgtggctt cacggtgatc gtggacatgc gtgggtccaa gtgggactcc 300 

atcaagcccc ttctgaagat cctgcaggag tccttcccct gctgcatcca tgtggccctg 360 

atcatcaagc cagacaactt ctggcagaaa cagaggacta attttggcag ttctaaattt 4 20 

gaatttgaga caaatatggt ctctttagaa ggccttacca aagtagttga tccttctcag 4 80 

ctaactcctg agtttgatgg ctgcctggaa tacaaccacg aagaatggat tgaaatcaga 54 0 

gttgcttttg aagactacat tagcaatgcc acccacatgc tgtctcggct ggaggaactt 600 

caggacatcc tagctaagaa ggagctgcct caggatttag agggggctcg gaatatgatc 660 

gaggaacatt ctcagctgaa gaagaaggtg attaaggccc ccatcgagga cctggatttg 720 

gagggacaga agctgcttca gaggatacag agcagtgaaa gctttcccaa aaagaactca 780 

ggctcaggca atgcggacct gcagaacctc ttgcccaagg tgtccaccat gctggaccgg 840 

ctgcactcga cacggcagca tctgcaccag atgtggcatg tgaggaagct gaagctggac 900 

cagtgcttcc agctgaggct gtttgaacag gatgctgaga agatgtttga ctggatcaca 960 

cacaacaaag gcctgtttct aaacagctac acagagattg ggaccagcca ccctcatgcc 1020 

atggagcttc agacgcagca caatcacttt gccatgaact gtatgaacgt gtatgtaaat 1080 

ataaaccgca tcatgtcggt ggccaatcgt ctggtggagt ctggccacta tgcctcgcag 1140 

cagatcaggc agatcgcgag tcagctggag caggagtgga aggcgtttgc ggcagccctg 1200 

gatgagcgga gcaccttgct ggacatgtcc tccattttcc accagaaggc cgaaaagtat 1260 

atgagcaacg tggattcatg gtgtaaagct tgcggtgagg tagaccttcc ctcagagctg 1320 

caggacctag aagatgccat tcatcaccac cagggaatat atgaacatat cactcttgct 1380 

tattctgagg tcagccaaga tgggaagtcg ctccttgaca agctccagcg gcccttgact 1440 

cccggcagct ccgattccct gacagcctct gccaactact ccaaggccgt gcaccatgtc 1500 

ctggatgtca tccacgaggt gctgcaccac cagcggcacg tgagaacaat ctggcaacac 1560 

cgcaaggtcc ggctgcatca gaggctgcag ctgtgtgttt tccagcagga agttcagcag 1620 

gtgctagact ggatcgagaa ccacggagaa gcatttctga gcaaacatac aggtgtgggg 1680 

aaatctcttc atcgggccag agcattgcag aaacgtcatg aagattttga agaagtggca 1740 

cagaacacat acaccaatgc ggataaatta ctggaagcag cagaacagct ggctcagact 1800 

ggggaatgtg accccgaaga gatttatcag gctgcccatc agctggaaga ccggattcaa 1860 

gatttcgttc ggcgtgttga gcagcgaaag atcctactgg acatgtcagt gtcctttcac 1920 

acccatgtga aagagctgtg gacgtggctg gaggagctgc agaaggagct gctggacgac. 1980 

gtgtatgccg agtcggtgga ggccgtgcag gacctcatca agcgctttgg ccagcagcag 2040 

cagaccaccc tgcaggtgac tgtcaacgtg atcaaggaag gggaggacct catccagcag 2100 

ctcagggact ctgccatctc cagtaacaag accccccaca acagctccat caaccacatt 2160 

gagacggtgc tgcagcagct ggacgaggcg cagtcgcaga tggaggagct cttccaggag 2220 

cgcaagatca agctggagct cttcctgcac gtgcgcatct tcgagaggga cgccatcgac 2280 

attatctcag acctcgagtc ttggaatgat gagctttctc agcaaatgaa tgacttcgac 2340 

acagaagatc tcacgattgc agagcagcgc ctccagcacc atgcagacaa agccttgacc 2400 

atgaacaact tgacttttga cgtcatccac caagggcaag atcttctgca gtatgtcaat 24 60 

gaggtccagg cctctggtgt ggagctgctg tgtgatagag atgtagacat ggcaactcgg 2520 

gtccaggacc tgctggagtt tcttcatgaa aaacagcagg aattggattt agccgcagag 2580 

cagcatcgga aacacctgga gcagtgcgtg cagctgcgcc acctgcaggc agaagtgaaa 2640 

caggtgctgg gttggatccg caacggagag tccatgttaa atgccggact tatcacagcc 2700 

agctcgttac aagaggcaga gcagctccag cgagagcacg agcagttcca gcatgccatt 2760 

gagaaaacac atcagagcgc gctgcaggtg cagcagaagg cagaagccat gctacaggcc 2820 

aaccactacg acatggacat gatccgggac tgcgccgaga aggtggcgtc tcactggcaa 2880 

cagctcatgc tcaagatgga agatcgcctc aagctcgtca acgcctctgt cgctttctac 2940 

aaaacctcag agcaggtctg cagcgtcctc gagagcctgg aacaggagta caagagagaa 3000 
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gaagactggt 
cccatgatca 
cggaggaatg 
atggtgacgc 
caacgggaga 
cagcagtacg 
ggcgagttct 
ctcctgaaag 
ctattgatac 
aaaaaatgtg 
tacaggacct 
agtctccagc 
gctgctcatg 
gctgagctca 
tacctgtggg 
ctcatcatct 
gagctggaaa 
gacaagtttc 
ttggaacatg 
atttcttcct 
gagctgctga 
ctcagcgtgc 
gaaaacattg 
aaaaccttaa 
tttagtaaag 
tttacctcag 
tgggtgggga 
aacaagcagg 
aagggagccc 
ggaaggaggg 
atttccatcg 
tgtgagctga 
cgacggggcc 
gtgcggacca 
atcgcccact 
tccgtctcca 
atcggggccc 
accagccccg 
cacaagcaca 
tccgacgaca 
ggcctgagca 
gaggaaggcg 
cacagcctcc 
cccaccagct 
aaaagcaaga 
agcagccctt 
gagatggaag 
gtggagacag 
cttatgaaag 
aacatccatc 
cttgaagatc 
tacatagctt 
accttttttg 
atcaaaccag 
tccaaaaagg 
gtacccaggc 
atcgttgccc 



gtggcggggc 
gcaagcacct 
cagacgtctt 
acatcaaagc 
acagggtatt 
tggtctttga 
acctttccac 
agcacgagga 
agctggctga 
ttactgctgt 
ctttggaaaa 
tagatatcat 
aacttaatga 
ttcaaactga 
aaatgaccag 
tcggaaacat 
aatatgaaca 
agatgtatgt 
cagggtccta 
accttattaa 
cgtgctgtga 
cgaagcgagc 
agtctcaggg 
ttcgaaaggg 
aagtgaaaga 
agttgggtgt 
gaacaccaac 
actggataaa 
tgaaggagcc 
atggagagga 
cctcacggac 
cagtggtgat 
agaccgtgga 
ctgaccgctc 
ccagaagtag 
gcaatgacgc 
agagctcgcc 
tgcggcggct 
agaagagccg 
gtgcggccac 
gcggtactct 
aggagggggc 
tccagccaga 
ccgaaacacc 
tggcactgga 
ccttcaaccc 
aaaggaaatc 
agcgtgacta 
aagatggtgt 
agatttacga 
cagaaaaact 
attgtcaaaa 
aggacttaaa 
tgcagagaat 
ccagcctgga 
ggtgcaacga 
agggtaaact 



ggataagctg 
ggagcagaag 
cctgaaatac 
tcctgaacag 
gcattactgg 
gaggagtgcc 
acacacctcc 
gttccagata 
tggcttttgt 
ggataagagg 
agccctgggg 
tccagccagt 
agagaagcgg 
aaaggcttat 
tggcgtggaa 
gcaagaaatc 
gttgccagag 
cacatattgc 
ttttgacgag 
accagttcag 
ggaaggaaag 
caatgacgcc 
agaactcatc 
tcgagaacgg 
ttccagtggg 
cacagaacat 
ttcagataat 
gcatatccgc 
cattcacatc 
tctggacagc 
gtctcagaac 
ccatgacttc 
agttctggag 
cccagcggca 
catggaaatg 
cagtccaccc 
gggccccaag 
cagcagcggc 
cgaggtccgc 
cccgcaggac 
ctccaaatcc 
cgacgccgtg 
ctcacaggat 
gagtgcagcc 
ggatcgcccc 
ttcggataat 
cagctcttta 
tgtgcgggac 
tcctgatgac 
ctggcacaga 
aggatccctt 
taaaccaaag 
gcagcgtctt 
catgaagtat 
tacatcagaa 
catgatgaac 
gctcttgcag 



ggcccaaact 
gaggcattcc 
ctgcacagga 
caagtgaaaa 
accatgagga 
aagcaggctt 
acgggctcca 
actgcaaagc 
gaaaaagggc 
tacagagatt 
atttcttcag 
atccctggct 
aaatctgccc 
gtaagagacc 
gagattccac 
tacgaatttc 
gatgttggac 
aaaaataagc 
atacagcagc 
cgaataacga 
ggagagatta 
atgcacctca 
ctacaggaat 
catctcttcc 
agaagcaagt 
gttgaaggag 
aaaattgtcc 
gaagtcatcc 
cctaagaccg 
caaggagacg 
acgctggaca 
accgcttgca 
cggccgcatg 
gaaggcctgg 
gagggcatct 
gcatccgtgg 
cggccgggca 
aaggccgacg 
aagagcgccg 
gagacggtcg 
tcctcctcgg 
cccctgccgc 
gacaaggcct 
gagctcgtca 
agctcactcc 
tcccttctct 
aagagaagac 
cttggctatg 
atgaaaggaa 
gacttttttt 
tttgttaaac 
tctgagcaca 
ggccacaggt 
cagctgttac 
ttagagagag 
gtggggcggc 
gacacattct 



ctgagacgga 
tgaaggcttg 
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agagacggct 
tggaatggat 
gtatacagca 
aaaccaaaga 
atgcccatgc 
tctctctgcg 
attccaacaa 
cagaggtgaa 
gcaggaaaga 
tccgggaatg 
ctggcattgt 
ataataacat 
attgttttgt 
ctgattctac 
gacatggatt 
aatatcagct 
aagatggcct 
gcatgctgga 
ccttccaagt 
tttttgaaat 
acctttataa 
acccttgcaa 
ttaaggcttc 
aggagcggac 
ctcccgccac 
gcagcagcca 
gcgataagct 
acagcaacga 
acaagcctga 

tcccctgtgg 
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gagagtgaag 
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atcgagtaaa 
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gttcataatg 
tatggatacg 
aaacaaagaa 
attcctaaag 
tacttgggca 
tcagctgata 
agccaattcc 
ccttttaaaa 
ggaggtgatg 
agggtttgat 
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gtccttagta 
aagcaaattg 
atttgcactg 
cagcatagag 
gatccacctg 
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gcctgatacg 
ctctggtggc 
gctgaccatc 
ctggtgtctg 
ttcactgtgc 
agactcgctc 
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gaagctggcg 
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ctgtggagaa 
catccagcag 
attagtccgc 
ggaactcgtg 
gggagatagt 
gcccattgat 
gcaagaacta 
ctacatggca 
tgtgttcggc 
agagaagtgc 
gttgcacatg 
atacattgat 
agatctgttg 
cctcaagtat 
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5940 
6000 
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6180 
6240 
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6420 



Page 124 



WO 02/092015 



PCT/US02/15982 



032796-132. ST25 



ggacttctgc ctcgctgcag agagaggcgc atcttcctct ttgagcagat cgtcatattc 6480 

agcgaaccac ttgataaaaa gaagggcttc tccatgccgg gattcctgtt taagaacagt 6540 

atcaaggtga gttgcctttg cctggaggaa aatgtggaaa atgatccctg taaatttgct 6600 

ctgacatcga ggacgggtga cgtggtagag accttcattt tgcattcatc tagtccaagt 6660 

gtccggcaaa cttggatcca tgaaatcaac caaattttag aaaaccagcg caatttttta 6720 

aatgccttga catcgccaat cgagtaccag aggaaccaca gcgggggcgg cggcggcggc 6780 

ggcagcgggg cagcggcggg ggtgggggca gcggcggcgg cggggccccc agtggcggca 6840 

gcggccacag tggcggcccc agcagctgcg gcggcgcccc cagcacgagc aggagccggc 6900 

cctcccggat cccccagcct gtccgacacc acccccccgt gctggtctcc tctgcagcct 6960 

cgagccaggc agaggcagac aagatgtcag agtgaaagca gcagcagtag caacatctcc 7020 

accatgttgg tgacacacga ttacacggca gtgaaggagg atgagatcaa cgtctaccaa 7080 

ggagaggtcg ttcaaattct ggccagcaac cagcagaaca tgtttctggt gttccgagcc 7140 

gccactgacc agtgccccgc agctgagggc tggattccag gctttgtcct gggccacacc 7200 

agtgcagtca tcgtggagaa cccggacggg actctcaaga agtcaacatc ttggcacaca 7260 

gcactccgtt taaggaaaaa atctgagaaa aaagataaag acggcaaaag ggaaggcaag 7320 

ttagagaacg gttatcggaa gtcacgggaa ggactcagca acaaggtatc tgtgaagctt 7380 

ctcaatccca actacattta tgacgttccc ccagaattcg tcattccatt gagtgaggtc 7440 

acgtgtgaga caggggagac cgttgttctt agatgtcgag tctgtggccg ccccaaagcc 7500 

tcaattacct ggaagggccc tgaacacaac accttgaaca acgatggtca ctacagcatc 7560 

tcctacagtg acctgggaga ggccacgctg aagattgtgg gcgtgaccac ggaagatgac 7620 

ggcatctaca cgtgcatcgc tgtcaatgac atgggttcag cctcatcatc ggccagcctg 7680 

agggtcctag gtccagggat ggatgggatc atggtgacct ggaaagacaa ctttgactcc 7740 

ttctacagtg aagtggctga gcttggcagg ggcagattct ctgtcgttaa gaaatgtgat 7800 

cagaaaggaa ccaagcgagc agtggccact aagtttgtga acaagaagtt gatgaagcgc 7860 

gaccaggtca cccatgagct tggcatcctg cagagcctcc agcaccccct gcttgtcggc 7920 

ctcctcgaca cctttgagac ccccaccagc tacatcctgg tcttagaaat ggctgaccag 7980 

ggtcgcctcc tggactgcgt ggtgcgatgg ggaagcctca ctgaagggaa gatcagggcg 8040 

cacctggggg aggttctgga agctgtccgg tacctgcaca actgcaggat agcacacctg 8100 

gacctaaagc ctgagaatat cctggtggat gagagtttag ccaagccaac catcaaactg 8160 

gctgactttg gagatgctgt tcagctcaac acgacctact acatccacca gttactgggg 8220 

aaccctgaat tcgcagcccc tgaaatcatc ctcgggaacc ctgtctccct gacctcggat 8280 

acgtggagtg ttggagtgct cacatacgta cttcttagtg gcgtgtcccc cttcctggat 8340 

gacagtgtgg aagagacctg cctgaacatt tgccgcttag actttagctt cccagatgac 8400 

tactttaaag gagtgagcca gaaggccaag gagttcgtgt gcttcctcct gcaggaggac 84 60 

cccgccaagc gtccctcggc tgcgctggcc ctccaggagc agtggctgca ggccggcaac 8520 

ggcagaagca cgggcgtcct cgacacgtcc agactgactt ccttcattga gcggcgcaaa 8580 

caccagaatg atgttcgacc tatccgtagc attaaaaact ttctgcagag caggcttctg . 8640 

cctagagttt gacctatcca gaagttcttt ctcattctct ttcacctgcc aatcagctgt 8700 

taatctgaat tttcaagaga aaacaagcaa acataactga tcagctgccg gtatgttcat 8760 

cgtgtgaaat tgcattccaa gtgagctgtg ctcagcagtg cttggacaca gagctgcaag 8820 

ctgcgctggg gtggaggacc gtcacttaca ctctgccaag gacggaggtc gcattgctgt 8880 

atcacagtat tttttacgga tttctg 8906 



<210> 86 

<211> 1204 

<212> DNA 

<213> Homo sapiens 

<400> 86 

tcggcggcgg tggtatcggc ggcagctgtg agggggttcc gggaagatgg tgctgatcaa 60 

ggaattccgt gtggttttgc catgttctgt tcaggagtat caggttgggc agctttactc 120 

tgttgcagaa gctagtaaga atgagactgg tggtggagaa ggaattgaag tcttaaagaa 180 

tgaaccttat gagaaggatg gagaaaaggg acagtatacg cacaaaattt atcacctaaa 240 

gagcaaagtg cctgcattcg tgaggatgat tgctcccgag ggctccttgg tgtttcatga 300 

gaaagcctgg aatgcgtacc cctactgtag aacaattgta acgaatgaat atatgaaaga 360 

tgatttcttc attaaaatcg aaacatggca caaaccagac ttgggaacat tagaaaatgt 420 
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acatggttta gatccaaaca catggaaaac tgttgaaatt gtccatatag atattgcaga 480 

tagaagtcaa gttgaaccag cagactacaa agctgatgaa gacccagcat tattccagtc 540 

agtcaagacc aagagaggcc ctttgggacc caactggaag aaggagctgg caaacagccc 600 

tgactgtccc cagatgtgtg cctataagct ggtgaccatc aaattcaagt ggtggggact 660 

gcaaagcaaa gtagaaaact tcattcaaaa gcaagaaaaa cggatattta caaacttcca 720 

tcgccagctt ttttgttgga ttgacaagtg gatcgatctc acgatggaag acattaggag 780 

aatggaagac gagactcaga aagaactaga aacaatgcgt aagaggggtt ccgttcgagg 840 

cacgtcggct gctgatgtct agatgagtcc cctgtagggt cagagacaat gtcaaactgt 900 

ttacgtaatc aaggtcaagt gaggggaaca agcgcagcca gtgatgagtg aacaacaatc 960 

tgaccagtat cttgcagtgt tgacgtttcc cagatgtgtg cttgtgatga tacacacaca 1020 

tgcacaggtt ctcaaccacg tgtgtatata tgtatgtgtg catatgtctg tagctgtata 1080 

taaagcgcat gtagagctac agatccagat acacacactt gtgtatatat gtacatacag 1140 

acatactgaa gggattagta caatttctcc aaagtactgt acctatcttc agcaagaatg 1200 

caaa 1204 

<210> 87 
<211> 892 
<212> PRT 

<213> Homo sapiens 



<400> 87 



Met 


Asp 


His 


Tyr 


Asp Ser 


Gin Gin Thr Asn Asp * Tyr 


Met 


Gin 


Pro 


Glu 


1 








5 






10 










15 




Glu 


Asp 


Trp 


Asp 


Arg Asp 


Leu Leu 


Leu Asp 


Pro 


Ala 


Tro 


Glu 


Lys 


Gin 








20 






25 










30 






Gin 


Arg 


Lys 


Thr 


Phe Thr 


Ala Trp Cys Asn 


Ser 


His 


Leu 


Arg 


Lys 


Ala 






35 






40 










4 5 








Gly 


Thr 


Gin 


He 


Glu Asn 


He Glu 


Glu 


Asp 


Phe 


Arg 


Asp 


Gly Leu Lys 




50 








55 








60 










Leu 


Met 


Leu 


Leu 


Leu Glu 


Val He 


Ser 


Gly Glu Arg 


Leu 


Ala 


Lys 


Pro 


65 








70 








75 








80 


Glu 


Arg 


Gly 


Lys 


Met Arg 


Val His 


Lys 


He 


Ser 


Asn 


Val 


Asn 


Lys 


Ala 










85 






90 










95 




Leu 


Asp 


Phe 


He 


Ala Ser 


Lys Gly Val 


Lys 


Leu 


Val 


Ser 


He Gly Ala 








100 






105 










110 






Glu 


Glu 


He 


Val 


Asp Gly 


Asn Val 


Lys 


Met 


Thr 


Leu 


Gly 


Met 


He 


Trp 






115 






120 










125 








Thr 


lie 


He 


Leu 


Arg Phe 


Ala He 


Gin 


Asp 


He 


Ser 


Val 


Glu 


Glu 


Thr 




130 








135 








140 










Ser 


Ala 


Lys 


Glu 


Gly Leu 


Leu Leu 


Trp 


Cys 


Gin 


Arg 


Lys 


Thr 


Ala 


Pro 


145 








150 








155 










160 


Tyr 


Lys 


Asn 


Val 


Asn He 


Gin Asn 


Phe 


His 


He 


Ser 


Trp 


Lys Asp Gly 










165 






170 










175 




Leu 


Gly 


Phe 


Cys 


Ala Leu 


He His 


Arg 


His 


Arg 


Pro 


Glu 


Leu 


He 


Asp 








180 






185 










190 




Tyr 


Gly 


Lys 


Leu 


Arg Lys 


Asp Asp 


Pro 


Leu 


Thr 


Asn 


Leu 


Asn 


Thr 


Ala 






195 






200 










205 








Phe 


Asp 


Val 


Ala 


Glu Lys 


Tyr Leu Asp 


He 


Pro 


Lys 


Met 


Leu Asp Ala 




210 








215 








220 










Glu 


Asp 


He 


Val 


Gly Thr 


Ala Arg 


Pro 


Asp 


Glu 


Lys 


Ala 


He 


Met 


Thr 


225 








230 








235 










240 


Tyr 


Val 


Ser 


Ser 


Phe Tyr 


His Ala 


Phe 


Ser 


Gly Ala 


Gin 


Lys 


Ala 


Glu 










245 






250 










255 




Thr 


Ala 


Ala 


Asn 


Arg lie 


Cys Lys 


Val 


Leu 


Ala 


Val 


Asn 


Gin 


Glu 


Asn 








260 




• 


265 










270 






Glu 


Gin 


Leu 


Met 


Glu Asp 


Tyr Glu 


Lys 


Leu 


Ala 


Ser 


Asp 


Leu 


Leu 


Glu 



275 280 285 



Page 126 



WO 02/092015 



PCT/US02/15982 



032796-132. ST25 



Trn Tie 

up X 1^ 


Ara 


Arn 


290 






Thr Met 

1 111. L JO 


His 


Ala 

nx Q 


305 

«s V 








His 

nx o 


T .\rc 
Juyo 


A<sn Php 
noil t lie 




Thy 

1 UX 






340 


Phe Met 


Pro 


Ser 




355 






T at i 
IicU 


Gin 


370 






rl n Tie 

Ol U lie 


nx y 


niy 


385 

J U -J 






oin jr o 


Al A 

nia 


OCX. 


T.en Arn 
iic u m y 


Gin 

UX11 


juy o 






420 


Leu Leu 


Lys 


Lys 




435 




Asd Arci 


Val 


Glu 

ulU 


450 






Q en T\/r* 


lyr 


nop 


465 

T W J 






nop VjXII 


T**i^ 

irp 


nop 


T All Gill 

uCU UlU 


A V* /"T 

nig 


i ni 






500 


Glu Tyr 


Ala 


Lys 




515 

«J X J 




Met Glu 


A 

nop 


T on 


530 

j j y 






Gin Glv 


T a eu 


Thr 

1 111 


545 
«j ij 






Al a 

n jl a nop 


uy o 


Gin 

UX U 


TiVQ Tie 
u y o ixc 


Val 
vox 


Gin 

U 11 1 






580 


Thr Thr 


lie 


Thr 




595 
j -j 




Gin Leu 


Val 


Pro 


610 






Gin Gin 


His 


Asn 


625 






Val lie 


Glv 

uxy 


Pro 


Ser Tie 

w CI X X C 


Glu 

UlU 


Met 

1 1C I. 






660 


Gin Tyr 


Glu 


Lys 




675 




Glu Gly 


Asp 


His 


690 






His Thr 


Asn 


Tyr 


705 






Leu Thr 


Thr 


He 


Thr Arg 


Asp 


Ala 



Thr 


He Pro 


Trp 




295 




Met 


Gin Gin 


Lys 




310 




Prn 
tl w 


Pro Lys 


Val 


325 
j *j 






T.on 
Lie U 


Gin Thr 


Lys 


Gin 

UlU 


Gly Arg Met 






360 


Gin 


Val Glu 


Lys 




375 




Ton 
JjC U 


Glu Arg Leu 




390 




Tip 
lie 


His Glu Ala 








A 

nop 


Tyr Glu Thr 


n x o 


Glu Ala 


Phe 






440 


Gin 


He Ala 


Ala 




455 




Car 
OC1 


Pro Ser 


Val 




470 




A e n 
noil 


Leu Gly Ala 


a r s 

4 O J 






Gl ii 

UlU 


Lys Leu 

* 


Leu 


A y rr 
ni y 


Ala- Ala 


Pro 






520 


Gin 


Asp Thr 


Phe 




535 




Thr 
i in 


Ala His 


Glu 




550 




A rrr 
nx vj 


Leu Ala 


He 


565 

W \J «h/ 






Thr 
iiii 


Tyr His 


Val 


t J. u 


Gin Glu 


He 






600 


Arg 


Arg Asp 


Gin 




615 




Glu 


Arg Leu Arg 




630 




Trn 
x xp 


He Gin 


Thr 


645 

W T J 






His 

n x o 


Gly Thr 


Leu 


^e r 

OCX 


He Val 


Asn 






680 


Gin 


Leu He 


Gin 




695 




Thr 


Met Glu 


His 




710 




Ala 


Arg Thr 


He 


725 






Lys 


Gly He 


Ser 



Leu Glu Asn Arg 

300 



T an 
JjC U 


ulU 


TV £-* ^-V 

nSp 


rne 






jIj 




Gl n 


ulU 


Lys 


L.ys 










Lieu 


T\ M MB 

nrg 


Leu 


Ser 


1A ^ 
J 4 0 








Val 


O a v 

oer 


Asp 


He 


Gly 


Tyr 


Glu 


Glu 








380 


A e y** 
nop 


Hi c 

nis 


Leu 


Ala 






395 




irp 


i nr 


Asp Gly 




din 

4 1 U 






Ala 
Ml a 


i nr 


Leu 


Ser 


A ?S 








Gl n 
uiu 


oer 


Asp 


Leu 


He 


Ala 


Gin 


Glu 








460 


7V c* r*\ 

nsn 


nia 


Arg Cys 






475 




T ai i 

L»eu 


i nr 


Gin 


Lys 




a Qn 

4 3U 






ulU 


inr 


He 


Asp 


jUj 








true 


nSn 


Asn Trp 


He 


Val 


His 


Thr 








540 


Gl n 

bin 




Lys Ala 






555 




i»eu 


uiy 


lie 


His 




o / u 






A c n 

nsn 


Met 


Ala Gly 


JO J 








Hon 
no 11 


Gl \7 

uiy 


Lys 


Trp 


Ala 


Leu 


Thr 


Glu 








620 


Liy o 


Gin 

Ul 11 


Phe 


Gly 




• 


635 




iiy o 


Mpr 

rcc u 


Glu 


Glu 




OJW 






Gin 


A en 
nop 


Gin 


Leu 


bob 








Tyr 


Lys 


Pro 


Lys 


Glu 


Ala 


Leu 


He 








700 


He 


Arg 


Val 


Gly 






715 




Asn 


Glu 


Val 


Glu 




730 






Gin 


Glu 


Gin 


Met 



Page 127 



Va 1 
Vol 


fro 


mm 

blU 


Asn 


nrg 


TV _ _ 

Asp 


Tyr 


Arg 








320 


Gin 


Leu 


Glu 


He 










o r% l* 

nsn 


Arg 


Pro 


Ala 




JJ (J 






Asn 


Asn 


TV 1 _ 

Ala 


Trp 


JOJ 








Trp 


Leu 


Leu 


Asn 


GlU 


Lys 


Phe 


Arg 








400 


Lys 


Glu 


Ala 


Met 






410 




ii 

ulU 


lie 


T 1 CM 

Lys 


Ala 




4 ^n 






nia 


Aid 


HIS 


p T _ 

Gin 


4 A ^ 

4 4 J 








Lieu 


7V 

Asn 


blU 


T All 

Leu 


Gin 


Lys 


He 


Cys 








480 


Arg 


Arg 


Glu 


Ala 


• < 




4 yo 




bin 


Leu 


Tyr 


T A** 

Leu 




CI ft 

510 






Met 


Glu 


Gly 


Ala 










T 1 ^ 

lie 


ulU 


blU 


lie 


i nr 


Leu 


Pro 


Asp 








560 


Asn 


Glu 


Val 


Ser 






0 /o 




mr 


Asn 


n u*. 

pro 


Tyr 




con 






nop 


Hie 

nlS 


vai 


7\ 'to** Pw 

Arg 


fin s 








m ii 

ulU 


nis 


Ala 

nia 


Arg 


Al a 
nXa 


Gl n 

uin 


Ala 

nia 


nsn 








640 


He 


Gly 


Arg 


He 






DOj 




C ^ y 

Oci 


Hi « 
nib 


IicU 


7N V" 

nrg 




D / U 






T 1 ft 

ne 


ASp 


p i _ 

bin 


Leu 


Do j 








Phe 


Asp 


Asn 


Lys 


Trp 


Glu 


Gin 


Leu 








720 


Asn 


Gin 


He 


Leu 






735 




Asn 


Glu 


Phe 


Arg 



WO 02/092015 



032796-132. ST25 



PCT/US02/15982 









740 


745 






750 

r ^p* 


Ala 


Ser 


Phe 


Asn His 


Phe Asp Arg Asp His 


Ser Gly Thr 


Leu Glv Pro 






755 




760 




765 


Glu 


Glu 


Phe 


Lys Ala 


Cys Leu lie Ser Leu 


Gly Tyr Asp 


lie Glv Asn 

\j -i. y noil 




770 






775 


780 




Asp 


Pro 


Gin 


Gly Glu 


Ala Glu Phe Ala Ara 


lie Met 


Ser 


He Val Asn 


785 








790 


795 




800 


Pro 


Asn 


Arg 


Leu Gly 


Val Val Thr Phe Gin 


Ala Phe 


T 1 a 


Asd Phe Met- 








805 


810 






815 


Ser 


Arg 


Glu 


Thr Ala 


Asp Thr Asp Thr Ala 


Asp Gin 


Val 


Met Ala Ser 








820 


825 






830 


Phe 


Lvs 


lie 


Leu Ala 


Gly Asp Lys Asn Tyr 


lie Thr 


Met 


mo^j vjiu L6U 






835 




840 




845 




Arg 


Arg 


Glu 


Leu Pro 


Pro Asp Gin Ala Glu 


Tyr Cys 


He 


Ala Arg Met 




850 






855 


' 860 




Ala 


Pro 


Tyr 


Thr Gly 


Pro Asp Ser Val Pro 


Gly Ala 


Leu 


Asp Tyr Met 


865 








870 


875 




880 


Ser 


Phe 


Ser 


Thr Ala 


Leu Tyr Gly Glu Ser Asp Leu 












885 


890 









<210> 88 
<211> 197 
<212> PRT 

<213> Homo sapiens 
<400> 88 



Met 


Met 


Phe 


Pro 


Gin 


Ser Arg His Ser Gly Ser 


Ser 


His 


Leu 


Pro 


Gin 


1 








5 


10 








15 




Gin 


Leu 


Lys 


Phe 


Thr 


Thr Ser Asp Ser Cys Asp Arg 


He 


Lys Asp Glu 








20 




25 






30 






Phe 


Gin 


Leu 


Leu 


Gin 


Ala Gin Tyr His Ser Leu Lys Leu 


Glu 


Cys 


Asp 






35 






40 




45 




Lys 


Leu 


Ala 


Ser 


Glu 


Lys Ser Glu Met Gin Arg His 


Tyr 


Val 


Met 


Tyr 




50 








55 


60 








Tyr 


Glu 


Met 


Ser 


Tyr 


Gly Leu Asn He Glu Met 


His 


Lys 


Gin 


Ala 


Glu 


65 










70 75 










80 


He 


Val 


Lys 


Arg 


Leu 
85 


Asn Gly He Cys Ala Gin 

90 


Val 


Leu 


Pro 


Tyr 
95 


Leu 


Ser 


Gin 


Glu 


His 


Gin 


Gin Gin Val Leu Gly Ala 


He 


Glu 


Arg Ala Lys 








100 




105 






110 






Gin 


Val 


Thr 
115 


Ala 


Pro 


Glu Leu Asn Ser He He 
120 


Arg 


Gin 
125 


Gin 


Leu 


Gin 


Ala 


His 
130 


Gin 


Leu 


Ser 


Gin Leu Gin Ala Leu Ala 
135 


Leu 
140 


Pro 


Leu 


Thr 


Pro 


Leu 


Pro 


Val 


Gly 


Leu 


Gin Pro Pro Ser Leu Pro 


Ala 


Val 


Ser 


Ala 


Gly 


145 










150 155 










160 


Thr 


Gly 


Leu 


Leu 


Ser 


Leu Ser Ala Leu Gly Ser Gin Ala 


His 


Leu 


Ser 










165 


170 








175 




Lys 


Glu 


Asp 


Lys 


Asn 


Gly His Asp Gly Asp Thr His Gin 


Glu Asp Asp 








180 




185 






190 






Gly 


Glu 


Lys 
195 


Ser 


Asp 
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<400> 89 



Gly Asp Lys Glu Pro Thr Glu Thr 


He 


Glv Asp Leu 


Ser 


lie 

JL X v 




JufcJU 


1 5 




10 






15 




Asp Gly Leu Gin Leu Glu Ser Glu 


Val 


Val Thr Asn 


Glv 


Glu 

VJ J. u 


Thr 

Alii. 




20 


25 




30 






Cys Ser Glu Ser Ala Ser Gin Asn 


Asd 


Asd Glv Ser 


Arfl 


Cp r* 

•JCi. 




Ren 
rVop 


35 40 






45 




Glu Thr Arq Val Ser Thr Asn Glv 


Ser 


Asd Asd Pro 


Glu 

VJ X u 


no ^/ 


A3 A 


biy 


50 55 




60 








Ala Glv Glu Asn Ara Ara Val Ser 


Glv 


Asn Asn Ser 


Pro 


Car 

iJCl. 






65 70 




75 










Asn Gly Glv Phe Lvs Pro Ser Ara 


Pro 


Pro Ara Pro 


Ser 


Aror 






85 




90 






95 




Pro Pro Thr Pro Arg Arg Pro Ala 


Ser 


Val Asn Glv 


Ser 


Pro 


OCX. 


r\A. a 


100 


105 






110 

^ W 






Thr Ser Glu Ser Asp Gly Ser Ser 


Thr 


Glv Ser Leu 


Pro 


Pro 


Thr 


len 

1 


115 120 






125 








Thr Asn Thr Asn Thr Ser Glu Gly 


Ala 

• ^b» 


Thr Ser Glv 


Leu 


lie 


lie 

X J. c 


Pro 


130 135 




140 










Leu Thr lie Ser Gly Gly Ser Gly 


Pro 

^b» 


Ara Pro Leu 


Asn 


Pro 


Val 


Thr 


145 150 




155 








160 

J» v V 


Gin Ala Pro Leu Pro Pro Gly Trp 


Glu 


Gin Arq Val 


Asd 


Gin 


His 


Glv 


165 




170 






175 


Arg Val Tyr Tyr Val Asp His Val 


Glu 


Lvs Ara Thr 


Thr 


TrD 


Asn 


Aro 


180 


185 






190 


Pro Glu Pro Leu Pro Pro Glv TrD 


Glu 


Ara Ara Val 


Asd 




Met* 


oiy 


195 200 






205 






Arq lie Tyr Tyr Val Asp His Phe 


Thr 


Ara Thr Thr 


Thr 


Tro 


Gin 


A TTf 

/ax. y 


210 215 




220 








Pro Thr Leu Glu Ser Val Arq Asn 


Tvr 


Glu Gin Tro 


Gin 

Sb» ^ » a 


Leu 


Gin 


A T*rr 
n A y 


225 230 




235 








240 

4m ^ w 


Ser Gin Leu Gin Gly Ala Met Gin 


Gin 


Phe Asn Gin 


Ara 


Phe 


He 


Tvr 
x y x 


245 




250 






255 


Gly Asn Gin Asp Leu Phe Ala Thr 


Ser 


Gin Ser Lys 


Glu 


Phe 


Asd 


Pro 


260 


265 






270 




Leu Gly Pro Leu Pro Pro Gly Trp 


Glu 


Lys Arg Thr 


Asp 


Ser 


Asn 


Glv 


275 280 






285 






Arg Val Tyr Phe Val Asn His Asn 


Thr 


Arg He Thr 


Gin 


TrD 


Glu 


Asd 


290 295 




300 








Pro Arg Ser Gin Gly Gin Leu Asn 


Glu 


Lys Pro Leu 


Pro 


Glu 


Glv 


Tro 


305 310 




315 






320 


Glu Met Arg Phe Thr Val Asp Gly 


He 


Pro Tyr Phe 


Val 


Asp 


His 


Asn 

4 4> Imb* 4 4 


325 




330 






335 




Arg Arg Thr Thr Thr Tyr lie Asp 


Pro 


Arq Thr Gly 


Lvs 


Ser 

%B» *• 


Ala 

4 i«4W 


Leu 

4V W %b* 


340 


345 






350 






Asp Asn Gly Pro Gin lie Ala Tyr 


Val 


Arq Asd Phe 

K V BW B M B^B^ ^B B B 


Lvs 


Ala 


Lvs 


Val 

* W4 


355 360 






365 








Gin Tyr Phe Arg Phe Trp Cys Gin 


Gin 


Leu Ala Met 


Pro 


Gin 


His 


He 


370 375 




380 










Lys lie Thr Val Thr Arg Lys Thr 


Leu 


Phe Glu Asp 


Ser 


Phe 


Gin 


Gin 


385 390 




395 








400 


He Met Ser Phe Ser Pro Gin Asp 


Leu 


Arg Arg Arg 


Leu 


Trp 


Val 


He 


405 




410 






415 




Phe Pro Gly Glu Glu Gly Leu Asp 


Tyr 


Gly Gly Val 


Ala 


Arg 


Glu 


Trp 


420 


425 






430 






Phe Phe Leu Leu Ser His Glu Val 


Leu 


Asn Pro Met 


Tyr 


Cys 


Leu 


Phe 
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435 








440 


445 


Glu 

mm 


Tvr 


Ala 


Gly 


Lys 


Asp Asn Tyr Cys Leu 


Gin He Asn Pro Ala Ser 




450 










455 


4 60 


Tvr 


He 


Asn 


Pro 


Asp 


His 

mm ^m mf 


Leu Lys Tyr Phe Arg. Phe He Gly Arg Phe 


465 










470 




475 480 


lie 

mm ^ 


Ala 

• m^m mm 


Met 


Ala 


Leu 


Phe 

m, m m W 


His Gly Lys Phe 


He Asp Thr Gly Phe Ser 










485 




490 


495 


Leu 


Pro 

m mm m* 


Phe 

m, mm 


Tvr 


Lvs 


Arg 


He Leu Asn Lys 


Pro Val Gly Leu Lys Asp 








500 






505 


510 


Leu 


Glu 


Ser 

mS m* ±m 


He 

Jto J» w 


Asd 


Pro 


Glu Phe Tyr Asn 


Ser Leu He Trp Val Lys 






515 

*m mm 








520 


525 


Glu 


Asn 


Asn 

* mmf m m 


He 


Glu 


Glu 


Cys Asp Leu Glu 


Met Tyr Phe Ser Val Asp 




530 










535 


540 


Lys 


Glu 


He 

mm *m» \* 


Leu 


Glv 


Glu 


He Lys Ser His 


Asp Leu Lys Pro Asn Gly 


545 










550 




555 560 


Gly Asn 


He 

mm 


Leu 


Val 

▼ mm> 


Thr Glu Glu Asn Lys Glu Glu Tyr He Arg Met 










565 




570 


575 


Val 


Ala 


Glu 


TrD 


Ara 


Leu 


Ser Arg Gly Val Glu Glu Gin Thr Gin Ala 








580 






585 


590 


Phe 


Phe 


Glu 


Glv 


Phe 

A m m* 


Asn 


Glu He Leu Pro 


Gin Gin Tyr Leu Gin Tyr 






595 








600 


605 


Phe 


Asp 


Ala 


Lvs 

■mJ y kJ 


Glu 

V-J 


Leu 


Glu Val Leu Leu 


Cys Gly Met Gin Glu He 




610 










615 


620 


Asp Leu 


Asn 


Asd 


Tro 


Gin Arg His Ala He 


Tyr Arg His Tyr Ala Arg 


625 










630 




635 640 


Thr 


Ser 


Lvs 


Gin 

\J X 11 


He 


Met 


Trp Phe Trp Gin 


Phe Val Lys Glu He Asp 










645 




650 


655 


Asn 


Glu 


Lvs 


Ara 


Met 


Arg 


Leu Leu Gin Phe 


Val Thr Gly Thr Cys Arg 








660 






665 


670 


Leu 


Pro 


Val 


Glv 


Glv 


Phe Ala Asp Leu Met 


Gly Ser Asn Gly Pro Gin 






675 








680 


685 


Lys 


Phe 


Cys 


He 


Glu 


Lys 


Val Gly Lys Glu 


Asn Trp Leu Pro Arg Ser 




690 










695 


700 


His 


Thr 


Cys 


Phe 


Asn 


Arg Leu Asp Leu Pro 


Pro Tyr Lys Ser Tyr Glu 


705 










710 




715 720 


Gin 


Leu 


Lys 


Glu 


Lys 


Leu 


Leu Phe Ala He 


Glu Glu Thr Glu Gly Phe 










725 




730 


735 


Gly Gin 


Glu 













<210> 90 
<211> 431 
<212> PRT 

<213> Homo sapiens 



<400> 90 



Gly 


Pro 


Pro 


Pro 


Thr 


Arg Ala 


Leu Pro Leu Pro 


Gin 


Ser Leu Pro Pro 


1 








5 




10 




15 


Asp 


Phe 


Arg 


Leu 


Glu 


Pro Thr 


Ala Pro Ala Leu 


Ser 


Pro Arg Ser Ser 








20 






25 




30 


Phe 


Ala 


Ser 


Ser 


Ser 


Ala Ser 


Asp Ala Ser Lys 


Pro 


Ser Ser Pro Arg 






35 








40 




45 


Gly Ser 


Leu 


Leu 


Leu 


Asp Gly Ala Gly Ala Gly Gly Ala Gly Gly Ser 




50 








55 




60 




Arg 


Pro 


Cys 


Ser 


Asn 


Arg Thr 


Ser Gly He Ser 


Met 


Gly Tyr Asp Gin 


65 










70 


75 




80 


Arg 


His 


Gly 


Ser 


Pro 


Leu Pro 


Ala Gly Pro Cys 


Leu 


Phe Gly Pro Pro 
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85 






Leu Ala 


Gly 


Ala 


Pro Ala 


Gly 


Tyr 






100 








Tyr Pro 


Glu 


Leu 


His Ala 


Ala 


Leu 


• 


115 








120 


Ala Gly 


Phe 


Gly 


Cys Gin 


Glu 


Ser 


130 








135 




Gly Ser 


Pro 


Glv 

w * Jr 


Ala Leu 


Ala Gly 


145 






150 






Leu Glu 


Arg 


Ara 


Glv Ala 


Gin 


Pro 








165 






Gly Asp Cys 


Ala 


Val Glv 


Ala 


Arg 






180 








Leu Ara 


Leu 


Thr 


Val Glv 


Thr 


Gly 




195 








200 


Glu Pro 


Ser 


Glv 


He Glu 


Pro 


Ser 


210 








215 


• 


Phe Val 


Pro 


Glu 


Ala Ala 


Arg 


Ala 


225 






230 






Glu Asp 


Tyr 


Phe 


Glv Thr 


Cys 


He 








245 






Gin Ser 


Asn 


Ala 


Cys Gin 


Ala 


Leu 






260 








Phe Val 


Cys 


Cys 


Ser Cys 


Gly Arg 




275 








280 


Ser Val 


Asn 


Gly 


Ser Val 


Tyr Cys 


290 








295 




Phe Gin 


Glu 


Ala 


Ala Glu 


Lys 


Cys 


305 






310 






Glu Lys 


He 


Leu 


Gin Ala 


Met Gly 








325 






Arg Cys 


He 


Val 


Cvs Asn 


Lys 


Cys 






340 








Asp Phe 


Ser 


Asn 


Gin Val 


Tyr Cys 




355 








360 


Ala Pro 


Lys 




Ala A 1 a 
nld nla 


Cys 


Gly 


370 








375 




Cys Glu Asp 


He 


Val Arg 


Val 


He 


385 






390 






Glu Cys 


Tyr 


His 


Cys Glu 


Asp 


Cys 








405 






Gly Cys Cys 


Cys 


Phe Pro 


Leu Asp 






420 
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<213> Homo sapiens 

<400> 91 

Gly Pro Gly Ser Arg His His Arg 

1 5 
Gly Ala Val Ser Thr Asp Val Leu 

20 

Thr Gin Ser Pro Lys Met Asn He 

35 40 
Leu Gly Ala Ser Phe Thr Glu Asp 
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90 








95 


Ser 


Pro 


Glv Glv 


Val 

* a X 




Cor Al a 

OCX M.X d 


105 








110 

X X w 




Asd 


Arcr 


Leu Tvr 


Ala 


Gl n 

OX 11 


Aror Pro 
ni y it x \j 








XX J 






Arcr 


His 


Spr Tvr 




Prn 
t x U 


A 1 s T ati 
nld lieu 






140 








Ala 

nx cl 




Val Glv 
vox ox y 


Al a 


Ala 
nla 


oxy fro 






X wS «J 






160 


Glv 
oiy 


A TTT 

y 


nio ocx 


Vcl X 


i nr 


oxy xyr 




170 








X / J 


Tvr 


Gin 


A«?n Gl ii 


iiCU 


Thr 
A fix 


nla LcU 


185 








190 

X ^ V 




Glv 


Arcr 


Glu Ala 


Glv 
oxy 


Ala 
nla 


nxy oXy 








205 

C V J 






Glv 

vj X y 


IiCU 


Glu Gl ii 
oxu ox u 


JTx O 


trx O 


oxy fro 






220 






A r rr 


Met 


Arn Gl n 


Pro 

t x O 


Gl ii 
oXU 


Aid M.rg 






235 






Z 1 VJ 


T.we 
Xiy o 


r*vq 


Ben T.\/Q 


Gl \/ 
ox y 


x xe 


T t » v r* lit 

iyr oxy 




250 

w w V 








X J J 


A*3n 


Ser 


Tj^li Tvr 

ucu x y x 


His 


Thr 

1 11X 


oxn ^ys 


265 








270 




Thr 


Leu 


Arn C*v«5 


x»y o 


Al a 


pKo T\/r 
riic i yx 








285 






Glu 


Glu 


Asn Tvr 

ngu x y x 




Php 


oci oxy 






300 

\^ \^ 










Val 

V Cl X 


pwe Glv 
ox y 


His 
nig 


T on 
XrC U 


T 1 a Tan 
lie Xicll 






11 5 






Ox v 


Jj jf 3 


Car 


Tvr Hie 
i yi n x o 




Glv 
oxy 


v»«y5 trfie 




330 








j j j 




A 


Gl v Tie 
ox y lie 


Pro 
r x V 


tile 


Thr- Ua 1 
lUX Vdl 


345 












v at x 


1 11X 


Acn T \/ y 


His 


x»ys 


Ben T«iy" 

nsn l yjL 








365 






Gin 


Pro 


lie Leu 


Pro 


Ser 


Glu Gly 






380 








Ser 


Met 


Asp Arg 


Asp 


Tyr 


His Phe 






395 






400 


Arg 


Met 


Gin Leu 


Ser 


Asp 


Glu Glu 




410 








415 


Gly 


His 


Leu Leu 


Cys 


His 


Gly 


425 








430 




Ala 


Arg 


Asp Arg 


Leu 


He 


His Phe 




10 








15 


Gly 


Cys 


Ser Ala 


His 


Cys 


Ser Leu 


25 








30 




Gin 


Glu 


Gin Gly 


Phe 


Pro 


Leu Asp 








45 






Ala 


Pro 


Arg Pro 


Pro 


Val 


Pro Gly 
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50 


c c 

55 






00 








GlU 


GlU 


Giy giu Leu Val Ser Tnr 


Asp 


Pro Arg 


Pro 


TV 1 - 

Ala 


Ser Tyr 


Ser 


c e; 
Dj 




/u 




/5 








80 


Pne 


cys 


ber Gly Lys Gly Val Gly 


He 


Lys Gly 


Glu 


Thr 


Ser Thr 


Ala 










Oft 






95 




Tnr 


n w* >"*v 


Arg Arg ber Asp Leu Asp 


Leu 


Gly Tyr 


Glu 


Pro 


Glu Gly Ser 






i nn 


i n«; 

1U j 








110 




Ala 


m? #^ w* 

ser 


fro Tnr Pro Pro Tyr Leu 


Lys 


Trp Ala 


GlU 


Ser 


Leu His 


Ser 






i 1 c i oft 
11D 120 








125 






Leu 


Leu 


Asp Asp Gin Asp Gly He 


Ser 


Leu Pne 


Arg 


Thr 


Phe Leu 


Lys 






IOC 
1.55 






140 








Gin 


GlU 


Gly Cys Ala Asp Leu Leu 


Asp 


Pne Trp 


Phe 


Ala 


Cys Thr 


Gly 


14 5 




150 




ICC 

155 








160 


Pne 


Arg 


Lys Leu Glu Pro Cys Asp 


Ser 


^\ dM 1 _ _ 

Asn Glu 


Glu 


Lys 


Arg Leu 


Lys 






lo5 




170 






175 




Leu 


Ala 


Arg Ala lie Tyr Arg Lys 


Tyr 


lie Leu 


Asp Asn Asn Gly 


He 






i on 
lou 


1 AC 

loD 








190 




Val 


Ser 


Arg Gin Tnr Lys Pro Ala 


m u. „ 

Tnr 


Lys Ser 


Phe 


lie 


Lys Gly Cys 






13J 2UU 








205 






He 


Met 


Lys Gin Leu He Asp Pro 


TV T _ 

Ala 


Met Pne 


Asp Gin Ala Gin 


Thr 




210 


^13 






220 








Glu 


He 


Gin Ala Tnr Met Glu Glu 


Asn 


^0 M MM B • MM 

Tnr Tyr 


Pro 


Ser 


Phe Leu 


Lys 


225 
















o a n 


Ser Asp 


He Tyr Leu Glu Tyr Thr 


Arg 


Tnr Gly 


Ser 


Glu 


Ser Pro 


Lys 






24 5 




OCft 

250 






255 




Val 


Cys 


Ser Asp Gin Ser Ser Gly 


Ser 


Gly Tnr 


Gly Lys 


Gly He 


Ser 








o ft ^ 








270 




Gly Tyr 


Leu Pro Tnr Leu Asn Glu 


Asp 


Glu Glu 


Trp Lys 


Cys Asp 


Gin 






<:75 280 








285 






Asp Met 


Asp Glu Asp Asp Gly Arg 


Asp 


T\ 1 _ TV 1 » 

Ala Ala 


Pro 


Pro 


Gly Arg 


Leu 




290 


1 ft c 

295 






300 








Pro 


Gin 


Lys Leu Leu Leu Glu Tnr 


TV 1 _ 

Ala 


Ala Pro 


Arg Val 


Ser Ser 


Ser 


305 




J1U 




OLD 








320 


Arg Arg 


Tyr Ser Glu Gly Arg Glu 


Pne 


Arg Tyr 


Gly Ser 


Trp Arg 


Glu 






IOC 

025 




^ -j rv 






335 




Pro 


Val 


TV ^* m^ r i m m% Hft « _ «m> • • T T 1 TV a 

Asn fro ryr Tyr val Asn 


Ala 


Gly Tyr 


Ala 


Leu 


Ala Pro 


Ala 






n 


j4 j 








350 




Thr 


Ser 


Aia Asn- Asp ber Glu Gin 


Gin 


ser Leu 


Ser 


Ser 


Asp Ala 


Asp 














365 






Thr 


Leu 


ber Leu Tnr Asp Ser Ser 


val 


Asp Gly 


He 


Pro 


Pro Tyr 


Arg 




370 


J /o 






380 








He Arg 


T i * a i m% ft _ a TV m TV m* T * » 

Lys Gin His Arg Arg Glu 


Met 


Gin GlU 


Ser 


Ala 


Gin Val 


Asn 


385 
















400 


Gly Arg 


val Pro Leu Pro His lie 


Pro 


Arg Tnr 


Tyr Arg Val Pro 


Lys 






4 U5 










415 




Glu 


Val 


Arg val Glu Pro Gin Lys 


pne 


Air* r 1 it 

Ala GlU 


Glu 


Leu 


He His 


Arg 






4 4 on 


/IOC 








430 




Leu 


Glu 


Ala val Gin Arg Tnr Arg 


GlU 


TV 1 — pi n 

nia GlU 


Glu 


Lys 


Leu Glu 


Glu 






4 o5 4 U 








445 






Arg 


Leu 


Lys Arg Val Arg Met Glu 


Glu 


Glu Gly 


Glu 


Asp Gly Asp Pro 




450 


455 






460 








Ser 


Ser 


Gly Pro Pro Gly Pro Cys 


His 


Lys Leu 


Pro 


Pro 


Ala Pro 


Ala 


465 




470 




475 








480 


Trp 


His 


His Phe Pro Pro Arg Leu 


Cys 


Trp Thr 


Trp Ala Cys Ala 


Gly 






485 




4 90 






495 




Leu 


Arg 


Asp Ala His Glu Glu Asn 


Pro 


Glu Ser 


He 


Leu Asp Glu 


His 






" 500 


505 








510 
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Val 


Gin 


Arg 


val 

• 


Leu Arg Thr Thr Gly 


Arg Gin 


Ser 


Pro Gly 


Pro 


Gly 






CI c 




520 






525 




HlS 


Arg 


Ser 


f""l mn A 

Pro 


Asp Ser Gly His Val 


Ala Lys 


Met 


Pro Val 


Ala 


Leu 




con 






535 




540 








Gly 


Gly 


Ala 


Ala 


Ser Gly His Gly Lys 


His Val 


Pro 


Lys Ser Gly 


Ala 










ceo 
550 


555 








560 


Lys 


Leu 


Asp 


Ala 


7\ 1 _ A., T it • „ ft- 

Ala Gly Leu His His 


His Arg His 


Val His 


His 


m m > 

His 












570 






m+ 0m mjm 

575 




Val 


HIS 


His 


Ser 


Thr Ala Arg Pro Lys 


Glu Gin 


Val 


Glu Ala 


Glu 


Ala 








580 


585 






590 






Thr 


Arg 


Arg 


Ala 


Gin Ser Ser Phe Ala 


Trp Gly Leu 


Glu Pro 


• * * 

His 


Ser 






C ft c 

595 




600 






605 






His 


Gly 


Ala 


Arg 


Ser Arg Gly Tyr Ser 


Glu Ser 


Val 


Gly Ala 


Ala 


Pro 




01 U 










620 








Asn 


Ala 


Ser 


Asp 


Gly Leu Ala His Ser 


Gly Lys 


Val 


Gly Val 


Ala 


Cys 










coo 
630 


635 








640 


Lys 


Arg 


Asn 


Ala 


Lys Lys Ala Glu Ser 


Gly Lys 


Ser 


Ala Ser 


Thr 


Glu 










64 5 


650 






mm mm 

655 




val 


Pro 


Gly 


Ala 


Ser Glu Asp Ala Glu 


Lys Asn 


Gin 


Lys He 


Met 


Gin 








/" e f\ 
660 


665 






670 






Trp 


lie 


lie 


Glu 


Gly Glu Lys Glu lie 


Ser Arg 


His 


Arg Arg 


Thr 


Gly 






675 




680 






685 




His 


Gly 


Ser 


Ser 


Gly Thr Arg Lys Pro 


Gin Pro 


His 


Glu Asn 


Ser 


Arg 




©so 










700 






Pro 


Leu 


Ser 


T 

Leu 


Glu His Pro Trp Ala 


Gly Pro 


Gin 


Leu Arg 


Thr 


Ser 


/Uj 








*7 1 O 
/ 1U 


715 








720 


Val 


Gin 


Pro 


Ser 


His Leu Phe lie Gin 


Asp Pro 


Thr 


Met Pro 


Pro 


• • * 

His 










725 


730 






735 




Pro 


Ala 


Pro 


Asn 


Pro Leu Thr Gin Leu 


Glu Glu 


Ala 


Arg Arg 


Arg 


Leu 








74 0 


74 5 






750 






Glu 


Glu 


Glu 


Glu 


Lys Arg Ala Ser Arg 


Ala Pro 


Ser 


Lys Gin Arg 


Tyr 






T C C 




O C f\ 

760 






765 






Val 


Gin 


Glu 


Val 


Met Arg Arg Gly Arg 


Ala Cys Val 


Arg Pro 


Ala 


Cys 




flu 






775 




780 








Ala 


Pro 


Val 


T — - - 

Leu 


His Val Val Pro Ala 


Val Ser 


Asp 


Met Glu 


Leu 


Ser 


/OD 








TOO 


795 








O f\ ft 

800 


Glu 


Thr 


Glu 


Thr 


Arg Ser Gin Arg Lys 


Val Gly Gly 


Gly Ser 


Ala 


Gin 










one 
DUO 


810 






815 




Pro 


Cys 


Asp 


Ser 


lie Val Val Ala Tyr 


Tyr Phe 


Cys 


Gly Glu 


Pro 


He 
















830 






Pro 


Tyr 


Arg 


Thr 


Leu Val Arg Gly Arg 


Ala Val 


Thr 


Leu Gly Gin 


Phe 






QIC 

DO J 




o40 






845 






Lys 


Glu 


Leu 


Leu 


Thr Lys Lys Gly Ser 


Tyr Arg 


Tyr 


Tyr Phe 


Lys 


Lys 




850 






855 




860 








Val 


Ser 


Asp 


Glu 


Phe Asp Cys Gly Val 


Val Phe 


Glu 


Glu Val 


Arg 


Glu 


865 








870 


875 








880 


Asp 


Glu 


Ala 


Val 


Leu Pro Val Phe Glu 


Glu Lys 


He 


He Gly 


Lys 


Val 










885 


890 






895 




Glu 


Lys 


Val 


Asp 




















900 
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Met 


Val 


Pro 


Val 


Ala 


Val 


Thr 


Ala Ala 


Val Ala Prn \/al 


T ^m. _ _ J0 m K 

Leu ber 


He 


1 








5 








10 


15 




Asn 


Ser 


Asp 


Phe 


Ser Asp Leu 


Ara Glu 


He Lys Lys Gin 


Leu Leu 


Leu 








20 








25 




30 




He 


Ala 


Gly Leu 


Thr Arg Glu 


Ara Glv 


Leu Leu His Ser 


Ser Lys 


Trp 






35 










40 


45 


Ser 


Ala 


Glu 


Leu 


Ala 


Phe 


Ser 


Leu Pro 


Ala Leu Pro Leu 


Ala Glu 


Leu 




50 










55 




60 






Gin 


Pro 


Pro 


Pro 


Pro 


He 


Thr 


Glu Glu 


Asp Ala Gin Asp Met Asp Ala 


65 










70 






75 




80 


Tvr 


Thr 


Leu 


Ala 


Lys Ala 


Tyr 


Phe A*5D 

l lie noy 


Val Lys Glu Tyr Asp Arg Ala 










85 








90 


95 . 




Ala 


His 


Phe 


Leu 


His Gly Cys 


Asn Ser 


Lys Lys Ala Tyr 


Phe Leu 


Tyr 








100 








105 

m» mr mr 




110 


Met 


Tyr 


Ser Arg 


Tyr 


Leu 


Ser 


Glv Glu 


Lys Lys Lys Asp Asp Glu Thr 






115 










120 

J* mm V/ 


125 






Val 


Asp 


Ser 


Leu 


Gly 


Pro 


Leu 


Glu Lvs 


Gly Gin Val Lys 


Asn Glu 


Ala 




130 










135 




140 






Leu 


Arg 


Glu 


Leu 


Arg Val 


Glu 


Leu Ser 


Lys Lys His Gin Ala Arg Glu 


145 










150 






155 




160 


Leu Asp 


Gly Phe 


Gly Leu Tyr 


Leu Tvr 


Gly Val Val Leu Arg Lys 


Leu 










165 








170 


175 




Asp Leu 


Val 


Lys 


Glu 


Ala 


He 


Asd Val 


Phe Val Glu Ala 


Thr His 


Val 








180 








185 

•Am W mf 




190 




Leu 


Pro 


Leu 


His 


Trp Gly Ala 


Trn Leu 


Glu Leu Cys Asn 


Leu He 


Thr 






195 










200 

m^ mr \mf 


205 






Asp 


Lys 


Glu 


Met 


Leu Lys 


Phe 


Leu Ser 


Leu Pro Asp Thr 


Trp Met 


Lys 




210 










215 




220 


Glu 


Phe 


Phe 


Leu 


Ala 


His 


He 


Tvr Thr 


Glu Leu Gin Leu 


He Glu 


Glu 


225 










230 






235 




240 


Ala 


Leu 


Gin 


Lys 


Tyr 


Gin 


Asn 


Leu Tie 


Asp Val Gly Phe 


Ser Lys 


Ser 










245 








250 


255 




Ser 


Tyr 


He 


Val 


Ser 


Gin 


He 


Ala Val 


Ala Tyr His Asn 


He Arg Asp 








260 








265 

mm Km/ w 




270 




He 


Asp 


Lys 


Ala 


Leu 


Ser 


He 


Phe A^n 


Glu Leu Arg Lys 


Gin Asp 


Pro 






275 










280 


285 




Tyr Arg 


He 


Glu 


Asn 


Met 


Asp 


Thr Phe 

mm m + J*t 4 11 \m~ 


Ser Asn Leu Leu 


Tyr Val 


Arg 




290 










295 


* 


300 


Ser 


Met 


Lys 


Ser 


Glu 


Leu 


Ser 


Tvr Leu 

* Jr jl jlj w U 


Ala His Asn Leu 


Cys Glu 


He 


305 










310 






315 


320 


Asp Lys 


Tyr Arg 


Val 


Glu 


Thr 


Cvs Cvs 

mm* J mm* mm* J \m* 


Val He Gly Asn 


Tyr Tyr 


Ser 










325 








330 


335 




Leu v Arg 


Ser 


Gin 


His 


Glu 


Lys 


Ala Ala 

* ±-mm %mA m Aa^W *^ 


Leu Tyr Phe Gin 


Arg Ala 


Leu 








340 








345 




350 




Lys 


Leu 


Asn 


Pro 


Arg 


Tyr 


Leu 


Glv Ala 


Trp Thr Leu Met 


Gly His 


Glu 






355 










360 

mm? \0 \W 


365 




Tyr Met 


Glu 


Met 


Lys Asn 


Thr 


Ser Ala 

w J» JT* J. CI 


Ala He Gin Ala 


Tyr Arg 


His 




370 










375 




380 




Ala 


He 


Glu 


Val 


Asn Lys 


Arg 


A«?n Tvr 


Arg Ala Trp Tyr Gly Leu Gly 


385 










390 






395 




400 


Gin 


Thr 


Tyr 


Glu 


He 


Leu 


Lys 


Met Pro 


Phe Tyr Cys Leu 


Tyr Tyr 


Tyr 










405 








410 


415 




Arg Arg 


Ala 


His 


Gin 


Leu Arg 


Pro Asn 


Asp Ser Arg Met 


Leu Val 


Ala 








420 








425 


* 


430 




Leu 


Gly 


Glu 


Cys 


Tyr 


Glu 


Lys 


Leu Asn 


Gin Leu Val Glu 


Ala Lys 


Lys 






435 










440 


445 




Cys 


Tyr 


Trp Arg 


Ala 


Tyr Ala 


Val Gly 


Asp Val Glu Lys 


Met. Ala 


Leu 
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450 










4 

*i j j 














Val 


Lvs 


Leu 


Ala 


JJ jf o 


JjC LI 


Hi q 
n j. s 


\j±m K3±n j^eu 


inr biu 


Ser 


Glu 


Gin 


Ala 


465 










470 






475 










Ala 


Gin 


Cvs 




lie 

485 


Lys 


Tyr 


He Gin Asp 
490 


He Tyr 


Ser 


Cys 


Gly 
4 95 


Glu 


lie 


Val 


Glu 


His 
500 


Leu 


Glu 


Glu 


Ser Thr Ala 
505 


Phe Arg 


Tyr 


Leu 
510 


Ala 




T vr 


Tvr 


Phe 


Lvs 

u y «j 


Cvs 


Lys 


Leu 


Trp Asp Glu 


Ala Ser 


Thr Cys Ala 








515 










520 




525 














nld 


irne 


Asn Asp Thr Arg Glu Glu Gly 


Lys Ala Leu 


Leu 




530 










535 




540 










Arg 


Gin 


He 


Leu 


Gin 


Leu Arg Asn Gin Gly Glu Thr 


Pro 


Thr 


Thr 


Glu 


545 










550 






555 








560 


Val 


Pro 


Ala 


Pro 


Phe 
565 


Phe 


Leu 


Pro Ala Ser 
570 


Leu Ser 


Ala 


Asn 


Asn 
575 


Thr 


Pro 


Thr 


Arg 


Arg 
580 


Val 


Ser 


Pro 


Leu Asn Leu 
585 


Ser Ser 


Val 


Thr 
590 


Pro 





<210> 93 
<211> 914 
<212> PRT 

<213> Homd sapiens 
<400> 93 



Val 


Tvr 


Gin 


Val 


Leu 


Leu 


Val Gly Ser Thr Leu Leu 


Lys Glu 


Val 


tr J. O 


1 








5 






10 






15 




Ser 


Gly 


Leu 


Gin 


Leu 


Glu 


Gin Leu 


Pro Ser Gin 


Ser 


Leu Leu 


Thr 


His 








20 








25 




30 






He 


Pro 


Thr 


Ala 


Gly 


Leu 


Pro Thr 


Sef Leu Gly Gly 


Gly Leu 


Pro 


Tyr 






35 








40 






45 




Cys 


His 


Gin 


Ala 


Trp 


Leu 


Asp Phe 


Arg Arg Arg 


Leu 


Glu Ala 


Leu 


Leu 




50 










55 




60 








Gin 


Asn 


Cys 


Gin 


Ala 


Ala Cys Ala 


Leu Leu Gin Gly 


Ala He 


Glu 


Ser 


65 










70 




75 








80 


Val 


Lys 


Ala 


Val 


Pro 


Gin 


Pro Met 


Glu Pro Gly Glu 


Val Gly Gin 


Leu 










85 






90 






95 




Leu 


Gin 


Gin 


Thr 


Glu 


Val 


Leu Met 


Gin Gin Val 


Leu 


Asp Ser 


■ Pro 


Trp 








100 








105 




110 




Leu 


Ala 


Trp 


Leu 


Gin 


Cys Gin Gly Gly Arg Glu Leu 


Thr Trp 


Leu 


Lys 






115 








120 






125 






Gin 


Glu 


Val 


Pro 


Glu 


Val 


Thr Leu 


Ser Pro Asp Tyr 


Arg Thr 


Ala 


Met 




130 










135 




140 








Asp 


Lys 


Ala 


Asp 


Glu 


Leu 


Tyr Asp Arg Val Asp 


Gly 


Leu Leu 


His 


Gin 


145 










150 




155 








160 


Leu 


Thr 


Leu 


Gin 


Ser 


Asn 


Gin Arg 


He Gin Ala 


Leu 


Glu Leu 


Val 


Gin 










165 






170 






175 




Thr 


Leu 


Glu 


Ala 


Arg 


Glu 


Ser Gly 


Leu His Gin 


He 


Glu Val 


Trp 


Leu 








180 








185 




190 




Gin 


Gin 


Val 


Gly 


Trp 


Pro 


Ala Leu 


Glu Glu Ala 


Gly 


Glu Pro 


Ser 


Leu 






195 








200 






205 






Asp 


Met 


Leu 


Leu 


Gin 


Ala 


Gin Gly 


Ser Phe Gin 


Glu 


Leu Tyr 


Gin 


Val 




210 










215 




220 






Ala 


Gin 


Glu 


Gin 


Val 


Arg Gin Gly Glu Lys Phe 


Leu 


Gin Pro 


Leu 


Thr 


225 










230 




235 








240 


Gly 


Trp 


Glu 


Ala 


Ala 


Glu 


Leu Asp 


Pro Pro Gly Ala 


Arg Phe 


Leu 


Ala 










245 






250 






255 




Leu 


Arg 


Ala 


Gin 


Leu 


Thr 


Glu Phe 


Ser Arg Ala 


Leu 


Ala Gin 


Arg 


Cys 
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260 



vjj.ii /ax y 




Ala 












Al a 
/ax a 








m n A *"o 
ulu /A J. y 


t to 


Cl v/ 








nis fro 


ASp 


L»eu 


Thr Gly 


Leu 


Gly 






340 




uys 


bin 










T we 
Jjy5 


LcU 


nn 






nin Val 


P Y~o 

nu 


Ala 


JO J 






true t\s>p 


7A V rf~T 

/Ax y 


A en 


His His 


Ala 


Ala 






420 


C 1 v Ala 


T All 


c XTO 










/Ax y 


OCX 


ft jU 






Ala Thr 
nxa nu 


ft! n 


A T*<T 

ni y 








i yr irne 




ulu 


Gin Arg 


Ala 


His 






500 


P\/<j Hie 


rne 


true 








/ax. y vciJL 


AT a 


iyr 








Ala Toti 


iyr 


oer 


5 






Tvr ft! v 


His 


1 11X 


Leu Asp 


Leu 


Ala 






580 


Tvr Ala 


Leu 


Leu 




595 

-J mJ -J 




Gin Glu 

Will VJ X u 


Leu 

VI. u 










T.pii A t rr 
ucu /ax y 


His 


Gl v 


625 






A*?o Va 1 


A^n 


T.pu 

JJC u 


Val Val 


Arg 


Thr 






660 


Glu Glu 


Leu 


Leu 




675 




Asp Thr 


Phe 


Ala 


690 






Thr Glu 


Cys 


Cys 


705 







Asp Ala 


Glu Arg 








280 


Glu 


Glu 


Gly Gin 






295 




Val 


Val 


Leu 


Gin 




310 






Pro 


Pro 


Ala 


His 


325 








Ser 


Glu 


Ala 


He 


Asp Thr 


Trp Leu 








360 


Pro 


Pro 


Val 


Gly 






375 




Ala 


Pro 


Ala 


His 




390 






Leu Gly 


Gin 


Ser 


405 








Thr 


He 


Ala 


Ala 


Gin 


Ala 


Ser 


Pro 








440 


Leu 


Asn 


Arg Leu 






455 




Glu 


Tyr 


Val 


Arg 




470 






Leu Asp 


Arg 


Pro 


485 








Leu 


Phe 


Gly Asn 


Leu Arg 


Glu 


Leu 








520 


Ala 


Phe 


Leu 


Arg 






535 




Lys 


Asn 


Lys 


Pro 




550 






Phe 


Phe 


Lys 


Asp 


565 








Ser 


Tyr 


Leu 


Leu 


Leu 


Gin 


Glu 


Leu 








600 


Ala 


Leu 


Arg 


Glu 






615 




Asn Asp 


Leu 


Leu 




630 






Lys 


Glu 


Gin Gly 


645 








Gly Arg 


His 


Lys 


Leu 


Phe 


Ser 


Lys 








680 


Tyr 


Lys 


Arg 


Ser 






695 




Gly Asn 


Ser 


Asn 




710 







265 

Leu Phe Gin Leu 

Arg Val Leu Ala 

300 

Gin Leu Gin Leu 
315 

Phe Arg Lys Met 
330 

Arg Gin Glu Cys 
345 

Ala Leu Asp Gin 

Ser Thr Ala Ser 

380 

Pro Pro Leu Arg 
395 

Leu Ser Glu Pro 
410 

Cys Arg Arg Pro 
425 

Thr Val Pro Pro 

Gin Leu Val Leu 

460 

Ala Leu Glu Tyr 
475 

Asp Val Pro Gin 
490. 

Leu Glu Lys Leu 
505 

Glu Ala Cys Thr 

His Arg Val Gin 

540 

Arg Ser Asp Ala 
555 

Lys Gin Gin Ala 
570 

Lys Pro He Gin 
585 

Ala Arg Ala Cys 

Ala Gin Ser Leu 

620 

Ala Met Asp Ala 
635 

Gin Leu Val Arg 
650 

Ser Val Arg Arg 
665 

Pro Arg His Gly 

Phe Lys Met Ala 

700 

Leu Arg Phe Glu 
715 



270 

Phe Arg Glu Ala 
285 

Glu Leu Glu Gin 

His Trp Thr Arg 

320 

Trp Ala Leu Ala 
335 

Arg Trp Ala Trp 
350 

Lys Leu Glu Ala 
365 

Leu Cys Val Ser 

Lys Ala Tyr Ser 

400 

Ala Cys His Cys 
415 

Glu Ala Gly Gly 
430 

Pro Gly Ser Ser 
445 

Ala Glu Met Val 

Thr Met Glu Asn 

480 

Gly Leu Arg Gly 
4 95 

Arg Asp Phe His 
510 

Arg His Pro Pro 
525 

Phe Gly Met Tyr 

Leu Met Ser Ser 

560 

Leu Gly Asp His 
575 

Arg Met Gly Lys 
590 

Gly Gly Pro Thr 
605 

Val His Phe Gin 

He Gin Gly Cys 

640 

Gin Asp Glu Phe 
655 

He Phe Leu Phe 
670 

Pro Thr Gly Val 
685 

Asp Leu Gly Leu 

He Trp Phe Arg 

720 
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Lvs 


Ala Arg Asp Thr Phe 






725 




Tip Lvs 


Gin 


Ala Trp Thr Ala Asp 






740 




Ala Val 


His 


Asn Lvs 


Glu Val Ara 




755 




760 


Val Glv 


Asn 


Lvs Ala 


Phe Ara Asd 


770 






775 


noil ^ 


Am 


Thr Val 


Asn Tyr Val 


785 






790 


A Tfr A 1 a 


Car 


He Ala 


Val Ala Pro 






805 




r= 1 v Ala 


Car 
OCX. 


Asn Ser 


Leu Pro Gly 






820 




/^l \/ Coy 
VjJLy OCX, 




Asn Leu 


His Leu Tyr 




835 

U <J •J 




840 


m y jr 


Pro 


Leu Tyr 


Pro Ser Phe 


850 

w w V 






855 


Glu Ala 


Glu 


Leu Gly Gly Gin Pro 


8 65 






870 


lie Ser 


Ser 


Gin Cys 


Pro Ser Ala 






885 




Ser Cys 


Val 


Ser Gly Gin Ala Leu 






900 




Cys Val 









<210> 94 
<211> 277 
<212> PRT 

<213> Homo sapiens 



<400> 94 



Leu 


Asn 


Tyr 


Leu 


Leu Glu Ser Arg 


1 








5 


Leu 


Lys 


Gin 


Gly 


He Ala Thr Ala 








20 




Pro 


Ala 


Val 


Leu 


Thr Val Val Thr 






35 




40 


Gly 


His 


His 


Thr 


Asp Gly Cys Leu 




50 






55 


Thr 


Gly 


Thr 


Gly 


Gly Ala Phe Trp 


65 








70 


His 


Ala 


Val 


Asp 


Val Val Leu His 










85 


Gin 


Ala 


Leu 


He 


Ala His Gly Ala 








100 




Leu 


Ala 


Gin 


Gly 


Leu Gin Asp His 




• 


115 




120 


Ala 


Ala 


Phe 


Val 


Gly Arg Leu Leu 




130 






135 


Leu 


Ala 


Val 


His 


Phe Leu Thr His 


145 








150 


Ser 


Ser 


Pro 


Thr 


Arg Val Ala Gly 










165 


Pro 


His 


Cys 


Leu 


Ala Arg Leu Leu 



180 
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Va JL 


beu 


Uill 


rlxa 


oer 


ser 


Leu 


Aia 




730 










1 J D 




Tie 


OC L 


His 


Leu 


1j6U 


i rp 


7\ •v** 

Arg 


bin 


74 *i 










/ DU 






net 


nio 


Glu 


Met 


vai 


C W» 

ber 


Met 


Gly 










* DO 






lie 


r\X a 


Pro 


Ser 


Pill 


bill 


TV 1 ^ 

Ala 


T 1 a 

lie 








780 










T A 1 1 

Xicu 


T \r © 

i#ys 


Cys Arg 


blU 


vai 


Arg 


O ^-v w« 

Ser 






795 










O Art 

800 


rne 


M.Sp 


His 


Asp 


oer 


Leu 


Tyr 


Leu 




ft i n 

0 x U 










QIC 




/iSp 


D v /-\ 

rro 


Ala 


Ser 


bys 


ber 


val 


Leu 


ft 

0 c. D 










0 jU 






7\ v- t~i 

/vrg 


Asp 


Pro 


Ala 


Leu 


Leu 


Gly 


Leu 










845 








Leu 


Glu 


Glu 


Ala 


Ala 


Leu 


Glu 


Ala 








860 










Ser 


Leu 


Thr 


Ala 


Glu 


Asp 


Ser 


Glu 






875 










880 


Ser 


Gly Ser 


Ser 


Gly 


Ser 


Asp 


Ser 




890 










895 




Gly 


Arg Gly Leu 


Glu Asp 


Leu 


Pro 


905 










910 






T OH 


Glu 


Ala 


Ala 


Ala 


His 


uys 


Aia 




10 










1j 




Car* 
OCX 


Leu 


Leu 


Pro 


Ala 


Gin 


Leu 


bin 












30 






uys 


His 


Val 


Val 


Val 


Ser 


vai 


nlS 










45 








r\±a 


Ala 


Leu 


Cys 


Arg 


Glu 


MSp 


Arg 








60 












Lys Asn Arg 


Val 


He 


V ax 


Q a r* 

OCX 






75 










ft 0 


Va 1 
vox 


His 


Gly Glu 


Gly Asn 


trx vj 


VOX 




90 














Pro 

t x U 


Glu 


Ala 


Ala 


Trp Val 


vdx 


bxy 


i ns 










110 






tr lie 


His 


Asp 


Glu 


Met 


Ser 


x nr 


Hie 
MIS 


• 








125 








Glu 


Pro Gly Val 


Gin 


Glu 


Val 


Leu 








140 










Val 


Val 


Glu 


Arg 


Leu 


Pro 


Thr 


Glu 






155 










160 


Glu 


Ala 


Val 


Ser 


Val 


He 


Lys 


Thr 




170 










175 




Gly 


Ser 


Val 


Asp 


Ala 


Lys 


Pro 


Thr 


185 










190 
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Leu Asp Ala Asn Ala Glu Val Val Pro Arg Arg Ala Arg Leu Glu Arg 

195 200 205 

Pro Leu Gin Leu Pro Gly Glu Arg Leu Gin Pro Pro Leu Gly Arg Ala 

210 215 220 

Trp Ala Ala Leu Pro Ala Arg Gly Gin Arg Glu Cys Arg Gin Arg Glu 
225 230 235 240 

Gly Gly Arg Pro Arg Arg Leu Arg Gly Ala Ser Gly Arg Gly Ala Gly 

245 250 255 

Ala Gly Arg Glu Glu Val Ser Val Gly Phe Ser Ala Gin Trp Glu Phe 

260 265 270 

Gly Ser Gly Arg His 
275 

<210> 95 

<211> 1120 

<212> PRT 

<213> Homo sapiens 



<400> 95 



Met 




Aro 


Val 


TiVQ T.v^ T.pn 


Ser 


Leu 


Ser 


Leu 


Ser 


Pro 


Ser 


Pro 


Gin 


1 














10 










15 




Thr 


Glv 




Pro 


Car Mpf Zirrr 


Thr 


Pro 


Leu Arg Glu 


Leu 


Thr 


Leu 


Gin 








20 






25 


• 








30 






Pro 


Glv 


Ala 


Lpu 

Si U 


Thr Thr 9pr 

1111. 1 111. W CL. 


Gly Lys Arg Ser Pro Ala Cys 


Ser 


Ser 






35 






40 










45 








Leu 

<4J wh 


Thr 


Pro 


Ser 


Tipu T.v^ 

JUwU y o liy d 


Leu Gly Leu Gin Glu 


Gly Ser 


Asn 


Asn 




50 
















60 










Ser 

>w» Jm 


Ser 


Pro 


Val 


A*?n Php Va 1 


Asn 


Asn 


Lys Arg Thr Asp Leu 


Ser 


Ser 


65 








70 








75 










80 


Glu 


His 


Phe 


Ser 


His Spr Spr 


Lys 


Trp 


Leu 


Glu 


Thr Cys 


Gin 


His 


Glu 










O J 






90 










95 




Ser 


Asp 


Glu 


Gin 


Pro Leu Asp 


Pro 


He 


Pro 


Gin 


He 


Ser 


Ser 


Thr 


Pro 








100 






105 










110 






Lys 


Thr 


Ser 


Glu 


Glu Ala Val 


Asp 


Pro 


Leu Gly Asn Tyr Met 


Val 


Lys 






115 






120 










125 






Thr 


He 


Val 


Leu 


Val Pro Ser 


Pro 


Leu Gly Gin Gin Gin Asp 


Met 


He 




130 






135 










140 










Phe 


Glu 


Ala 


Arg 


Leu Asp Thr 


Met 


Ala 


Glu 


Thr 


Asn 


Ser 


He 


Ser 


Leu 


145 








150 








155 










160 


Asn 


Gly 


Pro 


Leu 


Arg Thr Asp 


Asp 


Leu 


Val 


Arg 


Glu 


Glu 


Val 


Ala 


Pro 










165 






170 










175 




Cys 


Met 


Gly 


Asp 


Arg Phe Ser 


Glu 


Val 


Ala 


Ala 


Val 


Ser 


Glu 


Lys 


Pro 








180 






185 










190 




He 


Phe 


Gin 


Glu 


Ser Pro Ser 


His 


Leu 


Leu 


Glu 


Glu 


Ser 


Pro 


Pro 


Asn 






195 






200 










205 








Pro 


Cys 


Ser 


Glu 


Gin Leu His 


Cys 


Ser 


Lys 


Glu 


Ser 


Leu 


Ser 


Ser Arg 




210 






215 










220 










Thr 


Glu 


Ala 


Val 


Arg Glu Asp 


Leu 


Val 


Pro 


Ser 


Glu 


Ser 


Asn 


Ala 


Phe 


225 








230 








235 










240 


Leu 


Pro 


Ser 


Ser 


Val Leu Trp 


Leu 


Ser 


Pro 


Ser 


Thr 


Ala 


Leu 


Ala 


Ala 










245 






250 










255 




Asp 


Phe 


Arg 


Val 


Asn His Val 


Asp 


Pro 


Glu 


Glu 


Glu 


He 


Val 


Glu 


His 








260 






265 










270 






Gly 


Ala 


Met 


Glu 


Glu Arg Glu 


Met 


Arg 


Phe 


Pro 


Thr 


His 


Pro 


Lys 


Glu 






275 






280 










285 






Ser 


Glu 


Thr 


Glu 


Asp Gin Ala 


Leu 


Val 


Ser 


Ser 


Val 


Glu Asp 


He 


Leu 




290 






295 










300 
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Ser Thr Cvs 


Leu 


Thr 


Pro 


Asn Leu Val 


Glu Met 


Glu 


Ser 


Gin 


Glu 


Ala 


305 






310 




315 










320 


Pro Glv Pro 


Ala 


Val 


Glu Asp Val Gly Arg He Leu Gly Ser Asp Thr 






325 














335 




Glu Ser TrD 


Met 

4> W 


Ser 


Pro 


Leu Ala Trp 


T.oi i n 
JjtSU \J1U 


T \/e 


Gly Val Asn Thr 




340 






345 








350 






Ser Val Met 


Leu 


Glu 


Asn Leu Arg Gin 


Cot* Toil 


oer 


iieu 


Pro 


Ser 


Met 


355 








360 






J bo 








Leu Arcr Asd 


Ala 


Ala 


He 


Gly Thr Thr 


Dm Dho 


oer 


Thr 


Cys 


Ser 


Val 


370 








375 




JoU 








Glv Thr Tro 


Phe 


Thr 


Pro 


Ser Ala Pro 


Gin Glu 


Lys 


Ser 


Thr 


Asn 


Thr 


385 

^-r w 






390 




395 








400 


Ser Gin Thr 

iJ C JL VJJ.ll Xlli. 


Gly Leu 


Val 


Gly Thr Lys 


His Ser 


Thr 


Ser 


Glu 


Thr 


Glu 






405 






410 * 








415 




Gin Leu Leu 


Cys 


Gly 


Arg 


Pro Pro Asp 


Leu Thr 


Ala 


Leu 


Ser Arg His 




420 






425 








430 






Asd Leu Glu 


Asp Asn 


Leu 


Leu Ser Ser 


Leu Val 


lie 


Val 


Glu 


Phe 


Leu 


435 
^ ^ ^ 




m 




440 






445 








Ser Ara Gin 


Leu Arg 


Asp 


Trp Lys Ser 


Gin Leu 


Ala 


Val 


Pro 


His 


Pro 


450 








455 




460 










Glu Thr Gin 


Asp 


Ser 


Ser Thr Gin Thr Asp Thr Ser His 


Ser 


Gly 


He 


465 






470 




475 








480 


Thr Asn Lvs 


Leu 


Gin 


His 


Leu Lys Glu 


Ser His 


Glu 


Met 


Gly Gin Ala 






485 






490 








495 




Leu Gin Gin 


Ala Arg 


Asn 


Val Met Gin 


Ser Trp Val 


Leu 


He 


Ser 


Lys 




500 






505 








510 




Glu Leu lie 


Ser 


Leu 


Leu 


His Leu Ser 


Leu Leu 


His 


Leu 


Glu 


Glu 


Asp 


515 








520 






525 






Lvs Thr Thr 


Val 


Asn 


Gin 


Glu Ser Arg Arg Ala Glu Thr 


Leu 


Val 


Cys 


530 








535 




540 








Cvs Cvs Phe 


Asp 


Leu 


Leu 


Lys Lys Leu Arg Ala 


Lys 


Leu 


Gin 


Ser 


Leu 


545 






550 




555 










560 


Lvs Ala Glu 


Arg Glu 


Glu 


Ala Arg His 


Arg Glu 


Glu 


Met 


Ala Leu Arg 






565 






570 








575 




Glv Lv<? A^o 


Ala 


Ala 


Glu 


He Val Leu 


Glu Ala 


Phe 


Cys 


Ala 


His 


Ala 




580 






585 






590 






Ser Gin Ara 


He 


Ser 


Gin Leu Glu Gin Asp Leu Ala 


Ser 


Met 


Arg 


Glu 


595 








600 






605 








Phe Ara Glv 


Leu 


Leu 


Lys Asp Ala Gin 


Thr Gin 


Leu 


Val 


Gly Leu 


His 


610 








615 




620 










Ala Lvs Gin 


Glu 


Glu 


Leu 


Val Gin Gin 


Thr Val 


Ser 


Leu 


Thr 


Ser 


Thr 


625 






630 




635 










640 


Leu Gin Gin 


Asp 


Trp 


Arg 


Ser Met Gin 


Leu Asp Tyr Thr 


Thr 


Trp 


Thr 






645 






650 








655 




Ala Leu Leu 


Ser Arg 


Ser Arg Gin Leu 


Thr Glu 


Lys 


Leu 


Thr 


Val 


Lys 




660 






665 








670 




Ser Gin Gin 


Ala 


Leu 


Gin Glu Arg Asp Val Ala 


He 


Glu 


Glu 


Lys 


Gin 


675 








680 






685 






Glu Val Ser 


Arg 


Val 


Leu 


Glu Gin Val 


Ser Ala 


Gin 


Leu 


Glu Glu Cys 


690 

w ^ V/ 








695 




700 










Lys Gly Gin 


Thr 


Glu 


Gin 


Leu Glu Leu 


Glu Asn 


He Arg 


Leu 


Ala 


Thr 


705 






710 




715 










720 


Asp Leu Arg 


Ala 


Gin 


Leu 


Gin He Leu 


Ala Asn 


Met 


Asp 


Ser 


Gin 


Leu 






725 






730 








735 




Lys Glu Leu 


Gin 


Ser 


Gin 


His Thr His 


Cys Ala 


Gin 


Asp 


Leu 


Ala 


Met 




740 






745 








750 






Lys Asp Glu 


Leu 


Leu 


Cys 


Gin Leu Thr 


Gin Ser 


Asn 


Glu 


Glu 


Gin 


Ala 
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755 760 765 

Ala Gin Cys Val Lys Glu Glu Met Ala Leu Lys His Met Gin Ala Glu 

770 775 780 

Leu Gin Gin Gin Gin Ala Val Leu Ala Lys Glu Val Arg Asp Leu Lys 
785 790 795 800 

Glu Thr Leu Glu Phe Ala Asp Gin Glu Asn Gin Val Ala His Leu Glu 

805 810 815 

Leu Gly Gin Val Glu Cys Gin Leu Lys Thr Thr Leu Glu Val Leu Arg 

820 825 830 

Glu Arg Ser Leu Gin Cys Glu Asn Leu Lys Asp Thr Val Glu Asn Leu 

835 840 845 

Thr Ala Lys Leu Ala Ser Thr He Ala Asp Asn Gin Glu Gin Asp Leu 

850 855 860 

Glu Lys Thr Arg Gin Tyr Ser Gin Lys Leu Gly Leu Leu Thr Glu Gin 
865 870 875 880 

Leu Gin Ser Leu Thr Leu Phe Leu Gin Thr Lys Leu Lys Glu Lys Thr 

885 890 895 

Glu Gin. Glu Thr Leu Leu Leu Ser Thr Ala Cys Pro Pro Thr Gin Glu 

900 905 910 

His Pro Leu Pro Asn Asp Arg Thr Phe Leu Gly Ser He Leu Thr Ala 

915 920 925 

Val Ala Asp Glu Glu Pro Glu Ser Thr Pro Val Pro Leu Leu Gly Ser 

930 935 940 

Asp Lys Ser Ala Phe Thr Arg Val Ala Ser Met Val Ser Leu Gin Pro 
945 950 955 960 

Ala Glu Thr Pro Gly Met Glu Glu Ser Leu Ala Glu Met Ser He Met 

965 970 975 

Thr Thr Glu Leu Gin Ser Leu Cys Ser Leu Leu Gin Glu Ser Lys Glu 

980 985 990 

Glu Ala He Arg Thr Leu Gin Arg Lys lie Cys Glu Leu Gin Ala Arg 

995 1000 1005 

Leu Gin Ala Gin Glu Glu Gin His Gin Glu Val Gin Lys Ala Lys Glu 

1010 1015 1020 

Ala Asp He Glu Lys Leu Asn Gin Ala Leu Cys Leu Arg Tyr Lys Asn 
1025 1030 1035 1040 

Glu Lys Glu Leu Gin Glu Val He Gin Gin Asn Glu Lys He Leu Glu 

1045 1050 1055 

Gin He Asp Lys Ser Gly Glu Leu He Ser Leu Arg Glu Glu Val Thr 

1060 1065 * 1070 

His Leu Thr Arg Ser Leu Arg Arg Ala Glu Thr Glu Thr Lys Val Leu 

1075 1080 1085 

Gin Glu Ala Trp Gin Ala Ser Trp Thr Pro Thr Ala Ser Leu Trp Pro 

1090 1095 1100 

Pro lie Gly Ser Arg Arg Lys Cys Gly Ser Leu Arg Arg Trp Thr Asn 
1105 1110 1115 1120 

<210> 96 

<211> 540 

<212> PRT 

<213> Homo sapiens 

<400> 96 

Met Gly Thr Thr Ala Arg Ala Ala Leu Val Leu Thr Tyr Leu Ala Val 

1 5 10 15 

Ala Ser Ala Ala Ser Glu Gly Gly Phe Thr Ala Thr Gly Gin Arg Gin 

20 25 30 

Leu Arg Pro Glu His Phe Gin Glu Val Gly Tyr Ala Ala Pro Pro Ser 
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35 






40 








45 


Pro Pro 


Leu 


Ser 


Arg 


Ser Leu Pro Met 


Asp 


His 


Pro 


Asd Ser Ser Gin 


50 








55 






60 


His Gly 


Pro 


Pro 


Phe 


Glu Gly Gin Ser 


Gin 


Val 


Gin 


Pro Pro Pro Ser 


65 








70 




75 




80 


Gin Glu 


Ala 


Thr 


Pro 


Leu Gin Gin Glu 


Lys 


Leu 


Leu 


Pro Ala Gin T.pii 








85 




90 






95 


Pro Ala 


Glu Lys 


Glu 


Val Gly Pro Pro 


Leu 


Pro 


Gin 


Glu Ala Val Pro 






100 




105 








110 


Leu Gin 


Lys 


Glu 


Leu 


Pro Ser Leu Gin 


His 


Pro 


Asn 


Glu Gin Lvs Glu 




115 






120 








125 


Gly Thr 


Pro 


Ala 


Pro 


Phe Gly Asp Gin 


Ser 


His 


Pro 


Glu Pro Glu Ser 


130 








135 






140 




Trp Asn Ala Ala 


Gin 


His Cys Gin Gin 


Asp Arg 


Ser 


Gin Glv Glv Trn 


145 








150 




155 




160 


Gly His 


Arg 


Leu 


Asp Gly Phe Pro Pro 


Gly Arg 


Pro 


Ser Pro Asd Asn 








165 




170 






175 


Leu Asn 


Gin 


He 


Cys 


Leu Pro Asn Arg 


Gin 


His 


Val 


Val Tvr Glv Pro 






180 




185 








190 


Trp Asn 


Leu 


Pro 


Gin 


Ser Ser Tyr Ser 


His 


Leu 


Thr 


Ara Gin Glv Glu 




195 






200 








205 


Thr Leu 


Asn 


Phe 


Leu Glu He Gly Tyr 


Ser Arg 


Cys 


Cvs His Cvs Ara 


210 








215 






220 


Ser His 


Thr 


Asn 


Arg 


Leu Glu Cys Ala 


Lys 


Leu 


Val 


Tro Glu Glu Ala 


225 








230 




235 




240 


Met Ser Arg Phe 


Cys 


Glu Ala Glu Phe 


Ser 


Val 


Lys 


Thr Ara Pro His 








245 




250 






255 


Trp Cys 


Cys 


Thr 


Arg Gin Gly Glu Ala 


Arg 


Phe 


Ser 


Cys Phe Gin Glu 






260 




265 








270 


Glu Ala 


Pro 


Gin 


Pro 


His Tyr Gin Leu 


Arg Ala 


Cys 


Pro Ser His Gin 




275 






280 








285 


Pro Asp 


He 


Ser 


Ser 


Gly Leu Glu Leu 


Pro 


Phe 


Pro 


Pro Gly Val Pro 


290 








295 






300 


Thr Leu Asp Asn 


He 


Lys Asn He Cys 


His 


Leu 


Arg Arg Phe Arg Ser 


305 








310 




315 




320 


Val Pro Arg Asn 


Leu 


Pro Ala Thr Asp 


Pro 


Leu 


Gin 


Arg Glu Leu Leu 








325 




330 






335 


Ala Leu 


He 


Gin 


Leu Glu Arg Glu Phe 


Gin Arg 


Cys 


Cys Arg Gin Gly 






340 




345 








350 


Asn Asn 


His 


Thr 


Cys 


Thr Trp Lys Ala 


Trp Glu 


Asp 


Thr Leu Asp Lys 




355 






360 








365 


Tyr Cys 


Asp Arg 


Glu Tyr Ala Val Lys 


Thr 


His 


His 


His Leu Cys Cys 


370 








375 






380 


Arg His 


Pro 


Pro 


Ser 


Pro Thr Arg Asp 


Glu 


Cys 


Phe 


Ala Arg Arg Ala 


385 








390 




395 




400 


Pro Tyr 


Pro 


Asn 


Tyr Asp Arg Asp He 


Leu 


Thr 


He 


Asp He Ser Arg 








405 




410 






415 


Val Thr 


Pro 


Asn 


Leu 


Met Gly His Leu 


Cys 


Gly 


Asn 


Gin Arg Val Leu 






420 




425 








430 


Thr Lys 


His 


Lys 


His 


He Pro Gly Leu 


He 


His 


Asn 


Met Thr Ala Arg 




435 






440 








445 


Cys Cys 


Asp 


Leu 


Pro 


Phe Pro Glu Gin 


Ala 


Cys 


Cys 


Ala Glu Glu Glu 


450 








455 






460 




Lys Leu 


Thr 


Phe 


He 


Asn Asp Leu Cys 


Gly Pro 


Arg Arg Asn He Trp 


465 








470 




475 




480 


Arg Asp 


Pro 


Ala 


Leu 


Cys Cys Tyr Leu 


Ser 


Pro 


Gly Asp Glu Gin Val 








485 




4 90 






495 
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Asn Cys 


Phe 


Asn 


He 






500 




Asp Thr 


Glu 


Asn 


Ala 




515 






Thr Asn 


He 


Ser 


Ser 


530 








<210> 97 






<211> 462 






<212> PRT 






<213> Homo sapiens 


<400> 97 






Met Gly 


Lys 


Glu 


Lys 


1 






5 


Asp Ser 


Gly 


Lys 


Ser 






20 




Gly lie 


Asp 


Lys 


Arg 




35 






Met Gly 


Lys 


Gly 


Ser 


50 








Ala Glu 


Arg 


Glu 


Arg 


65 








Glu Thr 


Ser 


Lys 


Tyr 








85 


Asp Phe 


He 


Lys 


Asn 






100 




Val Leu 


He 


Val 


Ala 




115 






Lys Asn 


Gly 


Gin 


Thr 


130 








Val Lys 


Gin 


Leu 


He 

• 


145 








Pro Tyr 


Ser 


Gin 


Lys 








165 


Tyr He 


Lys 


T ■ • « 

Lys 


He 






180 




He Ser 


Gly 


Trp 


Asn 




195 






Pro Trp 


Phe 


Lys 


Gly 


210 








Gly Thr 


Thr 


Leu 


Leu 


225 








Pro Thr 


Asp 


Lys 


Pro 








245 


Gly Gly 


He 


Gly 


Thr 






260 




Lys Pro 


Gly 


Met 


Val 




275 






Val Lys 


Ser 


Val 


Glu 


290 








Gly Asp 


Asn 


Val 


Gly 


305 








Arg Arg 


Gly 


Asn 


Val 








325 


Ala Ala 


Gly 


Phe 


Thr 



340 



Asn Tyr Leu Arg Asn Val 

505 

Lys Gly Gin Gly Glu Gin 
520 

Thr Ser Glu Pro Lys Glu 
535 



Thr 


His 


He 


Asn He 


Val 








10 




Thr 


Thr 


Thr 


Gly His 


Leu 








25 




Thr 


He 


Glu 


Lys Phe 


Glu 






40 






Phe 


Lys 


Tyr 


Ala Trp 


Val 




55 








Gly 


He 


Thr 


He Asp He 


70 








75 


Tyr 


Val 


Thr 


He He Asp 








90 




Met 


He 


Thr 


Gly Thr 


Ser 








105 




Ala 


Gly Val 


Gly Glu 


Phe 






120 






Aro 


Glu 


His 


Ala Leu 


Leu 




135 








Val 


Gly Val 


Asn Lys 


Met 


150 








155 


Arg 


Tyr 


Glu 


Glu He 


Val 








170 




Gly 


Tyr Asn 


Pro Asp 


Thr 








185 




Gly 


Asp Asn 


Met Leu 


Glu 






200 






Trp 


Lys 


Val 


Thr Arg 


Lys 




215 








Glu 


Ala 


Val 


Asp Cys 


He 


230 








235 


Leu 


Arg 


Leu 


Pro Leu 


Gin 








250 




Val 


Pro 


Val 


Gly Arg Val 








265 




Val 


Thr 


Phe 


Ala Pro 


Val 






280 






Met 


His 


His 


Glu Ala 


Leu 




295 








Phe 


Asn 


Val 


Lys Asn 


Val 


310 








315 


Ala 


Gly Asp 


Ser Lys Asn 








330 




Ala 


Gin 


Val 


He He 


Leu 








345 





Ala Leu Val Ser Gly 
510 

Gly Ser Thr Gly Gly 
525 

Glu 
540 



Val He Gly His Val 

15 

He Tyr Lys Cys Gly 
30 

Lys Glu Ala Ala Glu 
45 

Leu Asp Lys Leu Lys 
60 

Ser Leu Trp Lys Phe 

80 

Ala Pro Gly His Arg 

95 

Gin Ala Asp Cys Ala 
110 

Glu Ala Gly He Ser 
125 

Ala Tyr Thr Leu Gly 
140 

Asp Ser Thr Glu Pro 

160 

Lys Glu Val Ser Thr 

175 

Val Ala Phe Val Pro 
190 

Pro Ser Ala Asn Met 
205 

Asp Gly Asn Ala Ser 
220 

Leu Pro Pro Thr Arg 

240 

Asp Val Tyr Lys He 

255 

Glu Thr Gly Val Leu 
270 

Asn Val Thr Thr Glu 
285 

Ser Glu Ala Leu Pro 
300 

Ser Val Lys Asp Val 

320 

Asp Pro Pro Met Glu 

335 

Asn His Pro Gly Gin 
350 
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He Ser Ala Gly Tyr Ala Pro Val Leu Asp Cys His Thr Ala His He 

355 360 365 

Ala Cys Lys Phe Ala Glu Leu Lys Glu Lys He Asp Arg Arg Ser Gly 

370 375 380 

Lys Lys Leu Glu Asp Gly Pro Lys Phe Leu Lys Ser Gly Asp Ala Ala 
385 390 395 400 

He Val Asp Met Val Pro Gly Lys Pro Met Cys Val Glu Ser Phe Ser 

405 410 415 

Asp Tyr Pro Pro Leu Gly Arg Phe Ala Val Arg Asp Met Arg Gin Thr 

420 425 430 

Val Ala Val Gly Val He Lys Ala Val Asp Lys Lys Ala Ala Gly Ala 

435 440 445 

Gly Lys Val Thr Lys Ser Ala Gin Lys Ala Gin Lys Ala Lys 
450 455 460 

<210> 98 
<211> 2328 
<212> PRT 

<213> Homo sapiens 
<400> 98 

Lys Ser Lys Arg Gin Ala Gin Gin Met Val Gin Pro Gin Ser Pro Val 

1 5 10 15 

Ala Val Ser Gin Ser Lys Pro Gly Cys Tyr Asp Asn Gly Lys His Tyr 

20 25 30 

Gin He Asn Gin Gin Trp Glu Arg Thr Tyr Leu Gly Asn Val Leu Val 

35 40 45 

Cys Thr Cys Tyr Gly Gly Ser Arg Gly Phe Asn Cys Glu Ser Lys Pro 

50 55 60 

Glu Ala Glu Glu Thr Cys Phe Asp Lys Tyr Thr Gly Asn Thr Tyr Arg 
65 70 75 80 

Val Gly Asp Thr Tyr Glu Arg Pro Lys Asp Ser Met He Trp Asp Cys 

85 90 95 

Thr Cys He Gly Ala Gly Arg Gly Arg He Ser Cys Thr He Ala Asn 

100 105 110 

Arg Cys His Glu Gly Gly Gin Ser Tyr Lys He Gly Asp Thr Trp Arg 

115 120 125 

Arg Pro His Glu Thr Gly Gly Tyr Met Leu Glu Cys Val Cys Leu Gly 

130 135 140 

Asn Gly Lys Gly Glu Trp Thr Cys Lys Pro He Ala Glu Lys Cys Phe 
145 150 155 160 

Asp His Ala Ala Gly Thr Ser Tyr Val Val Gly Glu Thr Trp Glu Lys 

165 170 175 

Pro Tyr Gin Gly Trp Met Met Val Asp Cys Thr Cys Leu Gly Glu Gly 

180 185 190 

Ser Gly Arg He Thr Cys Thr Ser Arg Asn Arg Cys Asn Asp Gin Asp 

195 200 205 

Thr Arg Thr Ser Tyr Arg He Gly Asp Thr Trp Ser Lys Lys Asp Asn 

210 215 220 

Arg Gly Asn Leu Leu Gin Cys He Cys Thr Gly Asn Gly Arg Gly Glu 
225 230 235 240 

Trp Lys Cys Glu Arg His Thr Ser Val Gin Thr Thr Ser Ser Gly Ser 

245 250 255 

Gly Pro Phe Thr Asp Val Arg Ala Ala Val Tyr Gin Pro Gin Pro His 

260 265 270 

Pro Gin Pro Pro Pro Tyr Gly His Cys Val Thr Asp Ser Gly Val Val 
275 280 285 
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Tyr Ser Val 

290 
Leu Cys Thr 
305 

Thr Gin Thr 

Phe Thr Tyr 

Gin Asp Gly 
355 

Gin Lys Tyr 

370 
Gly Gly Asn 
385 

Asn His Asn 

Lys Trp Cys 

Phe Cys Pro 
435 

Val Met Tyr 

450 
His Met Met 
465 

Cys lie Ala 

Thr Tyr Asn 

Met Leu Asn 
515 

Asp Pro Val 

530 
lie Gly Asp 
545 

Tyr Cys Tyr 

Thr Tyr Pro 

Pro Ser Gin 
595 

Ser His He 

610 
Gly Arg Trp 
625 

He Lys Gly 

He Gin Gin 

Thr Ser Thr 
675 

Thr Pro Phe 

690 
Thr Ala Ser 
705 

Ser Gly Phe 
Gin Tyr Leu 



Gly Met Gin 

Cys Leu Gly 
310 

Tyr Gly Gly 

325 
Asn Gly Arg 
340 

His Leu Trp 

Ser Phe Cys 

Ser Asn Gly 
390 

Tyr Thr Asp 
405 

Gly Thr Thr 
420 

Met Ala Ala 

Arg He Gly 

Arg Cys Thr 
470 

Tyr Ser Gin 
485 

Val Asn Asp 
500 

Cys Thr Cys 

Asp Gin Cys 

Ser Trp Glu 
550 

Gly Arg Gly 

565 
Ser Ser Ser 
580 

Pro Asn Ser 

Ser Lys Tyr 

Lys Glu Ala 
630 

Leu Lys Pro 

645 
Tyr Gly His 
660 

Ser Thr Pro 

Ser Pro Leu 

Ser Phe Val 
710 

Arg Val Glu 

725 
Asp Leu Pro 



Trp Leu Lys 
295 

Asn Gly Val 

Asn Leu Asn 

Thr Phe Tyr 
345 

Cys Ser Thr 

360 
Thr Asp His 
375 

Ala Leu Cys 

Cys Thr Ser 

Gin Asn Tyr 
425 

His Glu Glu 

440 
Asp Gin Trp 
455 

Cys Val Gly 

Leu Arg Asp 

Thr Phe His 
505 

Phe Gly Gin 

520 
Gin Asp Ser 
535 

Lys Tyr Val 

He Gly Glu 

Gly Pro Val 
585 

His Pro He 

600 
He Leu Arg 
615 

Thr He Pro 

Gly Val Val 

Gin Glu Val 
665 

Val Thr Ser 

680 
Val Ala Thr 
695 

Val Ser Trp 
Tyr Glu Leu 
Ser Thr Ala 



Thr Gin Gly Asn 
300 

Ser Cys Gin Glu 
315 

Gly Glu Pro Cys 
330 

Ser Cys Thr Thr 



Thr Ser Asn 

Thr Val Leu 
380 

His Phe Pro 
395 

Glu Gly Arg 
410 

Asp Ala Asp 

He Cys Thr 

Asp Lys Gin 
460 

Asn Gly Arg 
475 

Gin Cys He 
490 

Lys Arg His 



Tyr 
365 
Val 

Phe 

Arg 

Gin 

Thr 
445 
His 

Gly 

Val 

Glu 



Gly Arg Gly Arg 

525 

Glu Thr Gly Thr 
540 

His Gly Val Arg 
555 

Trp His Cys Gin 
570 

Glu Val Phe He 

Gin Trp Asn Ala 

605 

Trp Arg Pro Lys 
620 

Gly His Leu Asn 
635 

Tyr Glu Gly Gin 
650 

Thr Arg Phe Asp 

Asn Thr Val Thr 

685 

Ser Glu Ser Val 
700 

Val Ser Ala Ser 
715 

Ser Glu Glu Gly 
730 

Thr Ser Val Asn 



Lys Gin Met 

Thr Ala Val 
320 

Val Leu Pro 

335 
Glu Gly Arg 
350 

Glu Gin Asp 

Gin Thr Gin 

Leu Tyr Asn 
400 

Asp Asn Met 

415 
Lys Phe Gly 
430 

Asn Glu Gly 

Asp Met Gly 

Glu Trp Thr 
480 

Asp Asp He 

4 95 
Glu Gly His 
510 

Trp Lys Cys 

Phe Tyr Gin 

Tyr Gin Cys 
560 

Pro Leu Gin 

575 
Thr Glu Thr 
590 

Pro Gin Pro 

Asn Ser Val 

Ser Tyr Thr 
640 

Leu He Ser 

655 
Phe Thr Thr 
670 

Gly Glu Thr 

Thr Glu He 

Asp Thr Val 
720 

Asp Glu Pro 

735 
He Pro Asp 
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740 










745 








750 






Leu Leu Pro Gly Arg Lys 


Tyr 


He 


Val 


Asn 


Val 


Tyr Gin 


He 

^ JL- V** 


Ser 


Glu 




755 








760 








765 








Asp Gly Glu Gin Ser Leu 


He 


Leu 


Ser 


Thr 


Ser 


Gin Thr 


Thr 


Ala 


Pro 


770 








775 










780 








Asp Ala 


Pro Pro 


Asp 


Pro 


Thr 


Val 


Asp 


Gin 


Val 


Asd Asd 


Thr 


Ser 

KJ w A 


lie 

a. x c 


785 






790 










795 






800 

V/ V V/ 


Val Val Arg Trp Ser Arg 


Pro 


Gin 


Ala 


Pro 


He 


Thr Glv 


Tvr 


Arn 


He 

X X c 






805 










810 








815 




Val Tyr 


Ser Pro 


Ser 


Val 


Glu 


Gly 


Ser 


Ser 


Thr 


Glu Leu 


Asn 


Leu 


Pro 




820 










825 








830 






Glu Thr 


Ala Asn 


Ser 


Val 


Thr 


Leu 


Ser 


Asp 


Leu 


Gin Pro 


Gly 

Jr 


Val 


Gin 




835 








840 








• 845 






Tyr Asn 


He Thr 


He 


Tyr 


Ala 


Val 


Glu 


Glu 


Asn 


Gin Glu 


Ser 

mm S» 


Thr 


Pro 


850 








855 










860 








Val Val 


He Gin 


Gin 


Glu 


Thr Thr Gly Thr 


Pro Arg Ser 


Asd 


Thr 


Val 

V CI X 


865 






870 










875 






880 

W V V 


Pro Ser 


Pro Arg Asp Leu 


Gin 


Phe 


Val 


Glu 


Val 


Thr Asp 


Val 


Lvs 


Val 

* a x 






885 










890 








895 




Thr He 


Met Trp 


Thr 


Pro 


Pro 


Glu 


Ser 


Ala 


Val 


Thr Gly 


Tvr 


Ara 


Val 




900 










905 






910 




Asp Val 


He Pro 


Val 


Asn 


Leu 


Pro Gly Glu His Gly Gin 


Arg 


Leu 


Pro 




915 








920 








925 






lie Ser 


Arg Asn 


Thr 


Phe 


Ala 


Glu 


Val 


Thr Gly Leu Ser 


Pro 


Glv 


Val 


930 








935 










940 






Thr Tyr Tyr Phe 


Lys 


Val 


Phe 


Ala 


Val 


Ser 


His' 


Gly Arg 


Glu 


Ser 


Lvs 


945 






950 










955 








960 


Pro Leu 


Thr Ala 


Gin 


Gin 


Thr 


Thr 


Lys 


Leu Asp Ala Pro 


Thr 


Asn 


Leu 






965 










970 








975 




Gin Phe 


Val Asn 


Glu 


Thr 


Asp 


Ser 


Thr 


Val 


Leu 


Val Arg 


Trp 


Thr 


Pro 




980 










985 








990 






Pro Arg Ala Gin 


He 


Thr 


Gly 


Tyr Arg 


Leu 


Thr 


Val Gly 


Leu 


Thr 


Arg 




995 








1000 






1005 




Arg Gly Gin Pro Arg Gin 


Tyr 


Asn 


Val 


Gly 


Pro 


Ser Val 


Ser 


Lys 


Tyr 


1010 






1015 








1020 






Pro Leu Arg Asn 


Leu 


Gin 


Pro 


Ala 


Ser 


Glu 


Tyr 


Thr Val 


Ser 


Leu 


Val 


1025 






1030 








1035 






1040 


Ala He Lys Gly Asn Gin Glu 


Ser 


Pro 


Lys 


Ala 


Thr Gly Val 


Phe 


Thr 






1045 








1050 






1055 


Thr Leu 


Gin Pro 


Gly Ser 


Ser 


He 


Pro 


Pro 


Tyr 


Asn Thr 


Glu 


Val 


Thr 



1060 1065 1070 



Glu Thr Thr He Val He Thr Trp Thr Pro Ala Pro Arg He Gly Phe 

1075 1080 1085 

Lys Leu Gly Val Arg Pro Ser Gin Gly Gly Glu Ala Pro Arg Glu Val 

1090 1095 1100 

Thr Ser Asp Ser Gly Ser He Val Val Ser Gly Leu Thr Pro Gly Val 
1105 1110 1115 1120 

Glu Tyr Val Tyr Thr He Gin Val Leu Arg Asp Gly Gin Glu Arg Asp 

* 1125 1130 1135 

Ala Pro He Val Asn Lys Val Val Thr Pro Leu Ser Pro Pro Thr Asn 

1140 1145 1150 

Leu His Leu Glu Ala Asn Pro Asp Thr Gly Val Leu Thr Val Ser Trp 

1155 1160 1165 

Glu Arg Ser Thr Thr Pro Asp He Thr Gly Tyr Arg He Thr Thr Thr 

1170 1175 1180 

Pro Thr Asn Gly Gin Gin Gly Asn Ser Leu Glu Glu Val Val His Ala 
1185 1190 1195 1200 
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Asp Gin Ser Ser Cys Thr Phe Asp Asn Leu Ser Pro Gly Leu Glu Tyr 

1205 1210 1215 

Asn Val Ser Val Tyr Thr Val Lys Asp Asp Lys Glu Ser Val Pro He 

1220 1225 1230 

Ser Asp Thr He He Pro Ala Val Pro Pro Pro Thr Asp Leu Arg Phe 

1235 1240 1245 

Thr Asn He Gly Pro Asp Thr Met Arg Val Thr Trp Ala Pro Pro Pro 

1250 1255 1260 

Ser He Asp Leu Thr Asn Phe Leu Val Arg Tyr Ser Pro Val Lys Asn 
1265 1270 1275 1280 

Glu Glu Asp Val Ala Glu Leu Ser He Ser Pro Ser Asp Asn Ala Val 

1285 1290 1295 

Val Leu Thr Asn Leu Leu Pro Gly Thr Glu Tyr Val Val Ser Val Ser 

1300 1305 1310 

Ser Val Tyr Glu Gin His Glu Ser Thr Pro Leu Arg Gly Arg Gin Lys 

1315 1320 1325 

Thr Gly Leu Asp Ser Pro Thr Gly He Asp Phe Ser Asp He Thr Ala 

1330 1335 1340 

Asn Ser Phe Thr Val His Trp He Ala Pro Arg Ala Thr He Thr Gly 
1345 1350 1355 1360 

Tyr Arg He Arg His His Pro Glu His Phe Ser Gly Arg Pro Arg Glu 

1365 1370 1375 

Asp Arg Val Pro His Ser Arg Asn Ser He Thr Leu Thr Asn Leu Thr 

1380 1385 1390 

Pro Gly Thr Glu Tyr Val Val Ser He Val Ala Leu Asn Gly Arg Glu 

1395 1400 1405 

Glu Ser Pro Leu Leu He Gly Gin Gin Ser Thr Val Ser Asp Val Pro 

1410 1415 1420 

Arg Asp Leu Glu Val Val Ala Ala Thr Pro Thr Ser Leu Leu He Ser 
1425 1430 1435 1440 

Trp Asp Ala Pro Ala Val Thr Val Arg Tyr Tyr Arg He Thr Tyr Gly 

1445 1450 1455 

Glu Thr Gly Gly Asn Ser Pro Val Gin Glu Phe Thr Val Pro Gly Ser 

1460 1465 1470 

Lys Ser Thr Ala Thr He Ser Gly Leu Lys Pro Gly Val Asp Tyr Thr 

1475 1480 1485 

He Thr Val Tyr Ala Val Thr Gly Arg Gly Asp Ser Pro Ala Ser Ser 

1490 ' 1495 1500 

Lys Pro He Ser He Asn Tyr Arg Thr Glu He Asp Lys Pro Ser Gin 
1505 1510 1515 1520 

Met Gin Val Thr Asp Val Gin Asp Asn Ser He Ser Val Lys Trp Leu 

1525 1530 1535 

Pro Ser Ser Ser Pro Val Thr Gly Tyr Arg Val Thr Thr Thr Pro Lys 

1540 1545 1550 

Asn Gly Pro Gly Pro Thr Lys Thr Lys Thr Ala Gly Pro Asp Gin Thr 

1555 1560 1565 

Glu Met Thr He Glu Gly Leu Gin Pro Thr Val Glu Tyr Val Val Ser 

1570 1575 1580 

Val Tyr Ala Gin Asn Pro Ser Gly Glu Ser Gin Pro Leu Val Gin Thr 
1585 1590 1595 1600 

Ala Val Thr Asn He Asp Arg Pro Lys Gly Leu Ala Phe Thr Asp Val 

1605 1610 1615 

Asp Val Asp Ser He Lys He Ala Trp Glu Ser Pro Gin Gly Gin Val 

1620 1625 1630 

Ser Arg Tyr Arg Val Thr Tyr Ser Ser Pro Glu Asp Gly He His Glu 

1635 1640 1645 

Leu Phe Pro Ala Pro Asp Gly Glu Glu Asp Thr Ala Glu Leu Gin Gly. 
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1650 1655 1660 

Leu Arg Pro Gly Ser Glu Tyr Thr Val Ser Val Val Ala Leu His Asd 
1665 1670 1675 16 q 0 

Asp Met Glu Ser Gin Pro Leu He Gly Thr Gin Ser Thr Ala He Pro 

1685 1690 1695 

Ala Pro Thr Asp Leu Lys Phe Thr Gin Val Thr Pro Thr Ser Leu Ser 

1700 1705 1710 

Ala Gin Trp Thr Pro Pro Asn Val Gin Leu Thr Gly Tyr Arg Val Arg 

1715 1720 1725 

Val Thr Pro Lys Glu Lys Thr Gly Pro Met Lys Glu He Asn Leu Ala 

1730 1735 1740 

Pro Asp Ser Ser Ser Val Val Val Ser Gly Leu Met Val Ala Thr Lys 
1745 1750 1755 1760 

Tyr Glu Val Ser Val Tyr Ala Leu Lys Asp Thr Leu Thr Ser Arg Pro 

1765 1770 1775 

Ala Gin Gly Val Val Thr Thr Leu Glu Asn Val Ser Pro Pro Arg Arg 

1780 1785 1790 

Ala Arg Val Thr Asp Ala Thr Glu Thr Thr He Thr He Ser Trp Arg 

1795 1800 1805 

Thr Lys Thr Glu Thr He Thr Gly Phe Gin Val Asp Ala Val Pro Ala 

1810 1815 1820 

Asn Gly Gin Thr Pro He Gin Arg Thr He Lys Pro Asp Val Arg Ser 
1825 1830 1835 1840 

Tyr Thr He Thr Gly Leu Gin Pro Gly Thr Asp Tyr Lys He Tyr Leu 

1845 1850 1855 

Tyr Thr Leu Asn Asp Asn Ala Arg Ser Ser Pro Val Val He Asp Ala 

. I860 1865 1870 

Ser Thr Ala He Asp Ala Pro Ser Asn Leu Arg Phe Leu Ala Thr Thr 

1875 1880 1885 

Pro Asn Ser Leu Leu Val Ser Trp Gin Pro Pro Arg Ala Arg He Thr 

1890 1895 1900 

Gly Tyr He He Lys Tyr Glu Lys Pro Gly Ser Pro Pro Arg Glu Val 
1905 1910 1915 1920 

Val Pro Arg Pro Arg Pro Gly Val Thr Glu Ala Thr He Thr Gly Leu 

1925 1930 1935 

Glu Pro Gly Thr Glu Tyr Thr He Tyr Val He Ala Leu Lys Asn Asn 

1940 1945 1950 

Gin Lys Ser Glu Pro Leu He Gly Arg Lys Lys Thr Asp Glu Leu Pro 

1955 I960 1965 

Gin Leu Val Thr Leu Pro His Pro Asn Leu His Gly Pro Glu He Leu 

1970 1975 1980 

Asp Val Pro Ser Thr Val Gin Lys Thr Pro Phe Val Thr His Pro Gly 
1985 1990 1995 2000 

Tyr Asp Thr Gly Asn Gly He Gin Leu Pro Gly Thr Ser Gly Gin Gin 

2005 2010 2015 

Pro Ser Val Gly Gin Gin Met He Phe Glu Glu His Gly Phe Arg Arg 

2020 2025 2030 

Thr Thr Pro Pro Thr Thr Ala Thr Pro He Arg His Arg Pro Arg Pro 

2035 2040 2045 

Tyr Pro Pro Asn Val Gly Gin Glu Ala Leu Ser Gin Thr Thr He Ser 

2050 2055 2060 

Trp Ala Pro Phe Gin Asp Thr Ser Glu Tyr He He Ser Cys His Pro 
2065 2070 2075 2080 

Val Gly Thr Asp Glu Glu Pro Leu Gin Phe Arg Val Pro Gly Thr Ser 

2085 2090 2095 

Thr Ser Ala Thr Leu Thr Gly Leu Thr Arg Gly Ala Thr Tyr Asn He 

2100 2105 2110 
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lie Val Glu Ala Leu Lys. Asp Gin Gin Arg His Lys Val Arg Glu Glu 

2H5 2120 2125 

Val Val Thr Val Gly Asn Ser Val Asn Glu Gly Leu Asn Gin Pro Thr 

2130 2135 2140 

Asp Asp Ser Cys Phe Asp Pro Tyr Thr Val Ser His Tyr Ala Val Gly 
2145 2150 2155 2160 

Asp Glu Trp Glu Arg Met Ser Glu Ser Gly Phe Lys Leu Leu Cys Gin 

2165 2170 2175 

Cys Leu Gly Phe Gly Ser Gly His Phe Arg Cys Asp Ser Ser Arg Trp 

2180 2185 2190 

Cys His Asp Asn Gly Val Asn Tyr Lys He Gly Glu Lys Trp Asp Arg 

2195 2200 2205 

Gin Gly Glu Asn Gly Gin Met Met Ser Cys Thr Cys Leu Gly Asn Gly 

2210 2215 2220 

Lys Gly Glu Phe Lys Cys Asp Pro His Glu Ala Thr Cys Tyr Asp Asp 
2225 2230 2235 2240 

Gly Lys Thr Tyr His Val Gly Glu Gin Trp Gin Lys Glu Tyr Leu Gly 

2245 2250 2255 

Ala He Cys Ser Cys Thr Cys Phe Gly Gly Gin Arg Gly Trp Arg Cys 

2260 2265 2270 

Asp Asn Cys Arg Arg Pro Gly Gly Glu Pro Ser Pro Glu Gly Thr Thr 

2275 2280 2285 

Gly Gin Ser Tyr Asn Gin Tyr Ser Gin Arg. Tyr His Gin Arg Thr Asn 

2290 2295 2300 

Thr Asn Val Asn Cys Pro He Glu Cys Phe Met Pro Leu Asp Val Gin 
2305 2310 2315 2320 

Ala Asp Arg Glu Asp Ser Arg Glu 

2325 



<210> 99 




























<211> 188 




























<212> PRT 




























<213> Homo 


sapiens 
























<400> 99 


























» 


His Gin 


Thr 


His 


Lys 


Glu 


Gly 


Gly Ser 


Thr 


His 


Ala 


Ser 


Ala 


Asp 


Ala 


1 






5 










10 










15 




Trp Glu 


He 


He 


Glu 


Leu 


Glu 


Thr 


Glu 


He 


Glu 


Lys 


Phe 


Lys 


Ala 


Glu 






20 










25 








30 






Asn Ala 


Ser 


Leu 


Ala 


Lys 


Leu 


Arg 


He Glu Arg 


Glu 


Ser 


Ala 


Leu 


Glu 




35 










40 




• 






45 








Lys Leu 


Arg 


Lys 


Glu 


He 


Ala 


Asp 


Phe 


Glu 


Gin 


Gin 


Lys 


Ala 


Lys 


Glu 


50 










55 










60 








Leu Ala 


Arg 


He 


Glu 


Glu 


Phe 


Lys 


Lys 


Glu 


Glu 


Met Arg Lys 


Leu 


Gin 


65 








70 










75 










80 


Lys Glu 


Arg 


Lys 


Val 


Phe 


Glu 


Lys 


Tyr 


Thr 


Thr 


Ala 


Ala 


Arg 


Thr 


Phe 








85 










90 








95 




Pro Asp 


Lys 


Lys 


Glu 


Arg 


Glu 


Glu 


He 


Gin 


Thr 


Leu 


Lys 


Gin 


Gin 


He 






100 










105 










110 






Ala Asp 


Leu 


Arg 


Glu 


Asp 


Leu 


Lys 


Arg 


Lys 


Glu 


Thr 


Lys 


Trp 


Ser 


Ser 




115 










120 










125 








Thr His 


Ser 


Arg 


Leu 


Arg 


Ser 


Gin 


He 


Gin 


Met 


Leu 


Val 


Arg 


Glu 


Asn 


130 










135 










140 








Thr Asp 


Leu 


Arg 


Glu 


Glu 


He 


Lys 


Val 


Met 


Glu 


Arg 


Phe 


Arg 


Leu 


Asp 


145 








150 










155 








160 


Ala Trp 


Lys 


Arg 


Ala 


Glu 


Ala 


He 


Glu 


Ser 


Ser 


Leu 


Glu 


Val 


Glu 


Lys 








165 










170 










175 
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Lys Asp Lys Leu Ala Asn Thr Ser Val Arg Phe Gin 

180 185 

<210> 100 
<211> 284 
<212> PRT 
<213> Homo sapiens 

<400> 100 

Met Glu Pro Gly Asn Tyr Ala Thr Leu Asp Gly Ala Lys Asp He Glu 

1 ' 5 10 15 

Gly Leu Leu Gly Ala Gly Gly Gly Arg Asn Leu Val Ala His Ser Pro 

20 25 30 

Leu Thr Ser His Pro Ala Ala Pro Thr Leu Met Pro Ala Val Asn Tyr 

35 . 40 45 

Ala Pro Leu Asp Leu Pro Gly Ser Ala Glu Pro Pro Lys Gin Cys His 

50 55 60 

Pro Cys Pro Gly Val Pro Gin Gly Thr Ser Pro Ala Pro Val Pro Tyr 
65 70 75 80 

Gly Tyr Phe Gly Gly Gly Tyr Tyr Ser Cys Arg Val Ser Arg Ser Ser 

85 90 95 

Leu Lys Pro Cys Ala Gin Ala Ala Thr Leu Ala Ala Tyr Pro Ala Glu 

100 105 110 

Thr Pro Thr Ala Gly Glu Glu Tyr Pro Ser Arg Pro Thr Glu Phe Ala 

115 120 125 

Phe Tyr Pro Gly Tyr Pro Gly Thr Tyr His Ala Met Ala Ser Tyr Leu 

130 135 140 

Asp Val Ser Val Val Gin Thr Leu Gly Ala Pro Gly Glu Pro Arg His 
145 150 155 160 

Asp Ser Leu Leu Pro Val Asp Ser Tyr Gin Ser Trp Ala Leu Ala Gly 

165 170 175 

Gly Trp Asn Ser Gin Met Cys Cys Gin Gly Glu Gin Asn Pro Pro Gly 

180 185 190 

Pro Phe Trp Lys Ala Ala Phe Ala Asp Ser Ser Gly Gin His Pro Pro 

195 200 205 

Asp Ala Cys Ala Phe Arg Arg Gly Arg Lys Lys Arg He Pro Tyr Ser 

210 215 220 

Lys Gly Gin Leu Arg Glu Leu Glu Arg Glu Tyr Ala Ala Asn Lys Phe 
225 230 235 240 

He Thr Lys Asp Lys Arg Arg Lys He Ser Ala Ala Thr Ser Leu Ser 

245 250 255 

Glu Arg Gin He Thr lie Trp Phe Gin Asn Arg Arg Val Lys Glu Lys 

260 265 270 

Lys Val Leu Ala Lys Val Lys Asn Ser Ala Thr Pro 
275 280 

<210> 101 
<211> 676 
<212> PRT 
<213> Homo sapiens 

<400> 101 

Met Asp Lys Tyr Asp Asp Leu Gly Leu Glu Ala Ser Lys Phe He Glu 

1 5 10 15 

Asp Leu Asn Met Tyr Glu Ala Ser Lys Asp Gly Leu Phe Arg Val Asp 

20 25 30 

Lys Gly Ala Gly Asn Asn Pro Glu Phe Glu Glu Thr Arg Arg Val Phe 
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Ala Thr 

50 
Leu Gin 
65 

Gly Gly 

Leu Thr 

Ala Pro 

Pro Gin 
130 
Ser Gin 
145 

Ala Phe 

Gly Asp 

Lys Pro 

Pro Ser 
210 

His Pro 
225 

Glu Gly 

Glu Lys 

Pro Gly 

Pro- Val 
290 
Cys Pro 
305 

Glu Val 

Phe Gin 

Ser Pro 

Gly Pro 
370 
Pro Thr 
385 

Cys Lys 

Val Leu 

Pro Leu 

Leu Glu 
450 
Lys Ala 
465 

Gly Ala 



35 

Lys Met 

Glu Glu 

Arg Leu 

Val Asp 
100 
Gly Ala 
115 

Glu Gin 

Asp Cys 

His Gin 

Tyr Tyr 
180 
Gly Val 
195 

Ser Pro 

Leu Asn 

Ser Leu 

Pro Thr 
260 
Leu Pro 
275 

Gin Pro 

Arg Gin 

Ser Gly 

Asp Gly 
340 
Ala Gly 
355 

Lys Pro 

Ser Leu 

Glu Gly 

Leu Asp 
420 
Val Pro 
435 

Ala Leu 
Asp Tyr 
Gly Gin 



40 

Ala Lys He His 
55 

Thr Leu Pro Arg 
70 

Gly Pro Gin Ala 
85 

Gly Ala Ala Lys 

Val Thr Thr Leu 

120 

Arg Ser Arg Pro 
135 

Gly Ser Arg Glu 
150 

Pro Gly Pro Cys 
165 

Asp Asn Leu Ser 

Ser Pro Ser He 

200 

Gly Ser Asp Pro 
215 

His Arg Gin Leu 
230 

Gly Gly Gin Asn 
245 

Gly Leu Trp Ser 

Ser Pro Asn Leu 

280 

Arg Thr Pro Ser 
295 

Gly Gly Leu Pro 
310 

Val Met Ser Lys 
325 

Pro Lys Ser Tyr 

Leu Asp Gly Ser 

360 

Gly Cys Thr Asp 
375 

Val His Pro Val 
390 

Pro Leu Gly Trp 
405 

Ser Pro Ser Ser 

Gly Pro Glu Leu 

440 

Thr Gin Arg Leu 
455 

Phe Gly Ala Cys 
470 

Ala Cys Gin Ala 
485 



Leu Gin Gin Gin 

60 

Gly Ser Arg Gly 
75 

Arg Trp Glu Val 
90 

Pro Pro Leu Ala 
105 

Ala Ala Gly Gin 

Tyr Leu His Gly 

140 

Ser Leu Ala Thr 
155 

Glu Asp Pro Ser 
170 

Leu Ala Ser Pro 
185 

Gly Leu Ser Val 

Pro Leu Pro Lys 

220 

Ser Leu Ser Ser 
235 

Ser Gly He Gly 
250 

Thr Ala Ser Ser 
265 

Glu Asn Gly Ala 

Val Ser Ala Pro 

300 

Arg Ser Asn Ser 
315 

Pro Asn Val Asp 
330 

Leu Ser Ser Ser 
345 

Gin Gin Gly Ala 

Leu Gly Thr Gly 

380 

Met Ser Thr Leu 
395 

Ser Ser Asp Gly 
410 

Pro Arg Val Arg 
425 

Arg Pro Ser Ala 

Glu Arg Glu Met 

460 

Val Lys Cys Ser 
475 

Met Gly Asn Leu 
4 90 



45 

Gin Gin Gin Leu 

Pro Val Asn Gly 

80 

Val Gly Ser Lys 
95 

Ala Ser Thr Gly 
110 

Pro Pro Tyr Pro 
125 

Thr Arg His Gly 

Ser Glu Met Ser 

160 

Cys Leu Thr His 
175 

Lys Trp Gly Asp 
190 

Gly Ser Gly Trp 
205 

Pro Cys Gly Asp 

Ser Arg Ser Ser 

240 

Gly Arg Ser Ser 
255 

Gin Arg Val Ser 
270 

Pro Ala Val Gly 
285 

Leu Ala Leu Ser 

Gly Leu Gly Gly 

320 

Pro Gin Pro Trp 
335 

Ala Pro Ser Ser 
350 

Val Pro Gly Leu 
365 

Pro Lys Leu Ser 



Pro Glu Leu 

Ser Leu Gly 
415 

Leu Pro Cys 

430 
Ala Glu Leu 
445 

Asp Ala His 



Ser 
400 
Ser 

Gin 

Lys 

Pro 



Lys Gly Val Phe 

480 

Tyr His Asp Thr 
495 
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Cys Phe 


Thr 


Cys 


Ala 


Ala Cys 


Ser 




500 










Tyr Phe 


Val 
515 


Asn 


Gly 


Lys 


Val 


Phe 
520 


Gly Phe 


Gin 


Gin 


Ser 


Ala 


Asp Arg 


530 










535 




Met Asp 


Met 


lie 


Leu 


Gin 


Ala 


Leu 


545 








550 






Phe Arg 


Cys 


Val 


He 
565 


Cys 


Asn 


Glu 


Val Asp 


Ser 


Glu 
580 


Asn 


Lys 


He 


Tyr 


Leu Ala 


Pro 
595 


Lys 


Cys 


Ala 


Ala 


Cys 
600 


Gly Ser 


Asp 


Glu 


Thr 


He Arg Val 


610 










615 




Val Glu 


Cys 


Tyr 


His 


Cys 


Glu Asp 


625 








630 






Asp Gly 


His 


Arg 


Cys 
645 


Tyr 


Pro 


Leu 


Cys His 


Val 


Lys 
660 


Arg 


Leu 


Glu 


Lys 


Gin His 

• 


His 
675 


Phe 











<210> 102 
<211> 296 
<212> PRT 
<213> Homo sapiens 

<400> 102 



Ser 


Thr 


Gly 


Ser 


Glu 


Phe Pro Leu 


1 








5 




Ala 


Ala 


Arg 


Ala 


Gly 


Gly Arg Ala 








20 






Arg 


Glu 


Thr 


Arg 


Gly 


Asn Pro Gly 






35 






40 


Thr 


Met 


Thr 


Gly 


Ser 


Asn Met Ser 




50 








55 


Gin 


Arg 


Cys 


Gin 


Ala 


Arg Phe Ser 


65 










70 


Asn 


Gly 


Glu 


Leu 


Tyr 


His Glu His 










85 




Arg 


Pro 


Phe 


Pro 


Glu 


Gly Leu Phe 








100 






Cys 


Glu 


His 


Asp 


Phe 


Gin Met Leu 






115 






120 


Gly 


Glu 


Phe 


He 


He 


Gly Arg Val 




130 








135 


His 


Pro 


Gly 


Cys 


Phe 


Arg Cys Glu 


145 










150 


Leu 


Gly 


Phe 


Val 


Lys 


Asn Ala Gly 










165 




Asn 


Arg 


Glu 


Lys 


Ala 


Lys Gly Leu 








180 






His 


Leu 


Val 


He 


Asp 


Glu Gin Pro 






195 






200 



Arg 


Lys 


Leu Arg Gly Lys Ala 


Phe 


505 






510 




Cys 


Glu 


Glu Asp Phe 


Leu Tyr 


Ser 






525 






Cys 


Phe 


Leu Cys Gly 


His Leu 


He 






540 






Gly 


Lys 


Ser Tyr His 


Pro Gly Cys 






555 




560 


Cys 


Leu 


Asp Gly Val 


Pro Phe 


Thr 




570 




575 




Cys 


Val 


Arg Asp Tyr His Lys Val 


585 






590 




Gly 


Leu 


Pro He Leu 


Pro Pro 


Glu 






605 






Val 


Ser 


Met Asp Arg 


Asp Tyr 


His 






620 






Cys 


Gly 


Leu Glu Leu 


Asn Asp 


Glu 






635 




640 


Glu 


Asp 


His Leu Phe 


Cys His 


Ser 




650 




655 




Arg 


Pro 


Ser Ser Thr 


Ala Leu 


His 


665 






670 





Cys 


Thr 


Lys Ala Ser 


Pro Cys 


Ser 




10 




15 




Leu 


Gly 


Trp Arg Leu 


Gin Gin 


Gin 


25 






30 




Asn 


Pro 


Gly Leu Gly Val Ala Ala 






45 






Asp 


Ala 
• ■ 


Leu Ala Asn 


Ala Val 


Cys 




60 






Pro 


Ala 


Glu Arg He 


Val Asn 


Ser 






75 




80 


Cys 


Phe 


Val Cys Ala 


Gin Cys 


Phe 




90 




95 




Tyr 


Glu 


Phe Glu Gly Arg Lys 


Tyr 


105 






110 




Phe 


Ala 


Pro Cys Cys 


Gly Ser Cys 






125 






He 


Lys 


Ala Met Asn 


Asn Asn 


Trp 






140 






Leu 


Cys 


Asp Val Glu 


Leu Ala 


Asp 






155 




160 


Arg 


His 


Leu Cys Arg 


Pro Cys 


His 




170 




175 




Gly 


Lys 


Tyr He Cys 


Gin Arg 


Cys 


185 






190 




Leu 


Met 


Phe Arg Ser Asp Ala 


Tyr 



205 
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i i v 


Asp 


His 


Phe 


Asn 


\*yo lui njLS 


Cys Gly 


Lys 


mi n 

UlU 


Leu 


inr 


Ala 




210 

C JL V 










£, X <«J 






220 










ml 11 


nid 


Arg 


Glu 


Leu Lys 


vjj.y uiu Leu 


Tyr Cys 


Leu 


D V* 

fro 


cys 


HIS 


Asp 


99 ^ 










230 


• 




235 










240 


Lys 


Met 


Gly Val 


Pro 


He 


Cys Gly Ala 


Cys Arg Arg 


Pro 


He 


Glu 


Gly 










245 






250 










255 




Arg 


Val 


Val 


Asn 


Ala 


Leu 


uiy Jjys uin 


Trp 


His 


Val 


GlU 


His 


Phe 


Val 








260 






9 

zoo 










270 






Cys 


Ala 


Lys 


Cys 


Glu 


Lys 


pro trie Leu 


Gly His Arg 


HIS 


Tyr 


Glu 


Lys 






275 








280 








285 








Lys 


Gly 


Leu 


Ala 


Tyr Cys 


Glu Leu 


















290 










"5QC 
















<210> 103 






• 


















<211> 500 
























<212> PRT 
























<213> Homo sapiens 




















<400> 103 
























Met 


Glv 


He 


Gly 


Leu 


Ser 


nxa vjin uiy 


Val 


Asn 


Met 


Asn 


Arg 


Leu 


Pro 


1 








5 






10 










i ^ 

10 




Gly 


Trp 


Asd 


Lys 


His 


Ser 


i yr uiy lyr 


His 


Gly Asp 




Gly 


HIS 


-||-*- 

ser 








'20 






9 ^ 
















Phe 


Cvs 


Ser 


Ser 


Gly Thr 


uiy bin tro 


Tyr Gly 


Pro 


inr 


fne 


Tnr 


rn W m 

Tnr 






35 
























Glv 


Asd 


Val 


He 


Gly Cys 


cys vai Asn 


Leu 


He 


Asn 


Asn 


l nr 


cys 


Pne 




50 
















60 










Tvr 


Thr 


Lvs 


Asn 


Gly His 


oer Leu oiy 


He 


Ala 


Phe 


inr 


ASp 


T ah 

Leu 


Fro 


65 










70 






75 










o u 


Pro 


Asn 


Leu 


Tyr 


Pro 


Thr 


Wa 1 ml w T on 
val Uiy LcU 


Gin 


Thr 


Pro 


mi \t 
uiy 


ulu 


vai 


vai 










85 






90 










QS 




Asp 


Ala 


Asn 


Phe 


Gly Gin 


H i cs Pro Phot 
nxs rro rnc 


Val 


Phe Asp 


T 1 A 

lie 


pi,, 

ulu 


ASp 


i yr 








100 






i n^ 










1 xu 






Met 


Arg 


Glu 


Trp 


Arg 


Thr 


T I/O T 1 Q ml ¥-1 

Lys lie ulu 


Ala 


Gin 


He 


ASp 


Arg 


fne 


fro 






115 








1 £ U 








1 9^ 

1 £. «? 








He 


Gly 


Asp 


Arg 


Glu Gly 


fl ii Tva ml n 

uiu i rp uin 


Thr 


Met 


He 


Tin 

uin 


Lys 


Met 


vai 




130 










i o j 






140 










Ser 


Ser 


Tyr 


Leu 


Val 


His 


nis biy i yr 


Cys 


Ala 


Thr 


Aid 


mi n 


Ala 

Aia 


f ne 


145 










150 






155 










i ^n 
1 ou 


Ala 


Arg 


Ser 


Thr 


Asp 


Gin 


LliL val LcU 


Glu 


Glu 


Leu 


Ala 
Ala 


oer 


lie 


Lys 










165 






170 










175 




Asn 


Arg 


Gin 


Arg 


He 


Gin 


T.v/<5 T.Pii Vfl 1 
u y a ucu v ax 


Leu 


Ala 


Gly 


rvi y 


L Jc I- 


mi m 

uiy 


mi n 

olU 








180 






' 185 

JL w «J 










i Qn 






Ala 


He 


Glu 


Thr 


Thr 


Gin 


oiu bcu i yj. 


Pro 


Ser 


Leu 


T on 


ml n 

blli 


7\ •»>* /"T 

ni y 


Ben 

Asn 






195 
















£. VJ -J 






Pro 


Asn 


Leu 


Leu 


Phe 


Thr 


Jj6U Ljyo Vol 


Arg 


Gin 


Phe 


Tin 
1 16 


m 1 n 




v ai 




210 










9 1 S 
£ 1 J 






220 










Asn 


Gly 


Thr 


Asp 


Ser 


Glu 


vai nig ^ys 


Leu Gly Gly 


& V* /-v 

Arg 


O n v> 


fro 


Lys 


225 










230 






235 












Ser 


Gin 


Asp 


Ser 


Tyr 


Pro 


Val Ser Pro 


Arg 


Pro 


Phe 


Ser 


Ser 


Pro 


Ser 










245 






250 










255 




Met 


Ser 


Pro 


Ser 


His 


Gly 


Met Asn lie 


His 


Asn 


Leu 


Ala 


Ser 


Gly 


Lys 








2 60 






265 










270 






Gly 


Ser 


Thr 


Ala 


His 


Phe 


Ser Gly Phe 


Glu 


Ser 


Cys 


Ser 


Asn 


Gly 


Val 






275 








280 








285 








He 


Ser 


Asn 


Lys 


Ala 


His 


Gin Ser Tyr 


Cys 


His 


Ser 


Asn 


Lys 


His 


Gin 




290 










295 






300 
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Ser 


C A V 






Ti e? i*\ 

nsn 


w a 1 Prr> Pin Toil Acn 
Val t Vjiu bcu fiai* 


Ser 


He 


1 en Mot* Qo*» ft'*-/** 

/ion iit;i_ oer Arg 












0 1 u 


315 




o<cU 


Ser 


Gin 


m n 


Val 


tew 


Ben oho Thr Cor A^n 

Asn fne int oci rvon 


Asp Val 


no^ licL oiu inr 










3 c. 0 










Asp 


His 


Tyr 


Cnr 

oer 


7\ v> 


r i„ t/ai rl w P1 11 Thr* 

biy vai biy ijiu nii 


Ser 


Ser 


Ren P 1 « / Dk n T an 

risii oiy rne lieu 






1 a n 
340 




O«J0 








Asn Gly 


Ser 


Ser 


Lys 


HIS ASp Mis biu net 


Glu Asp 


bys Asp inr biu 






355 






^ en 






J DO 


Met 


Glu 


val 


Asp 


ber 


561 bin Jjeu Axy /\zry 


Gin 


Leu 


cys oiy biy oer 




370 








O / 0 




380 




Gin 


Ala 


Ala 


Tl 0 

lie 


blU 


A»-rr Mot- Tip Hi *? Pho 

m9 net lie ma 1 11c 


Gly Arg 


Pin T.on Pin Ala 


385 










^ Oft 


395 




4 nn 


Met 


Ser 


blU 


P1 i-» 

bin 


j^eu 


Arg Mi y Asp tys u±y 


Lys 


Asn 


ini nld Moll byo 










4UO 


din 






9 10 


Lys 


Met 


T At* 

Leu 


Lys 


Asp 


/\ia fne oer i<eu ueu 


Ala 


Tyr 


oer Asp fro irp 








420 




4^0 








Asn 


Ser 


Pro 


val 


Gly 


Asn bin lieu Asp fro 


He 


Gin 


Arg biu fro vai 






435 












A A tZ 

4 4 0 


Cys 


Ser 


Ala 


Leu 


Asn 


ber Aia lie i»eu biu 


Thr 


His 


Asn lieu fro Jbys 




450 












460 




Gin 


Pro 




T AM 




Ton Ala Mot" (21 v (*ln 


Ala 


Thr 


r?l n Pv<; T.on P 1 \/ 
uxii iicu uiy 


465 










inn 

4 70. 


475 




480 


Leu 


Met 


Ala 


Arg 


Ser 


Gly lie Gly Ser Cys 


Ala 


Phe 


Ala Thr Val Glu 










485 


490 






495 


Asp Tyr 


Leu 


His 


















500 













<210> 104 
<211> 387 
<212> PRT 

<213> Homo sapiens 



<400> 104 



Met 


Ala 


Thr 


Ser 


Gly 


Val Leu Pro Gly Gly 


Gly 


Phe 


Val 


Ala Ser Ala 


1 








5 


10 








15 


Ala 


Ala 


Val 


Ala 


Gly 


Pro Glu Met Gin Thr 


Gly Arg 


Asn 


Asn Phe Val 








20 




25 








30 


He 


Arg 


Arg 


Asn 


Pro 


Ala Asp Pro Gin Arg 


lie 


Pro 


Ser 


Asn Pro Ser 






35 






40 






45 




His 


Arg 


He 


Gin 


Cys 


Ala Ala Gly Tyr Glu 


Gin 


Ser 


Glu 


His Asn Val 




50 








55 




60 






Cys 


Gin 


Asp 


He 


Asp 


Glu Cys Thr Ala Gly 


Thr 


His 


Asn 


Cys Arg Ala 


65 










70 


75 






80 


Asp 


Gin 


Val 


Cys 


He 


Asn Leu Arg Gly Ser 


Phe 


Ala 


Cys 


Gin Cys Pro 










85 


90 








95 


Pro 


Gly 


Tyr 


Gin 


Lys 


Arg Gly Glu Gin Cys 


Val 


Asp 


He 


Asp Glu Cys 








100 




105 








110 


Thr 


He 


Pro 


Pro 


Tyr 


Cys His Gin Arg Cys 


Val 


Asn 


Thr 


Pro Gly Ser 






115 






120 






125 




Phe 


Tyr 


Cys 


Gin 


Cys 


Ser Pro Gly Phe Gin 


Leu 


Ala 


Ala 


Asn Asn Tyr 




130 








135 




140 






Thr 


Cys 


Val 


Asp 


He 


Asn Glu Cys Asp Ala 


Ser 


Asn 


Gin Cys Ala Gin 


145 








150 


155 






160 


Gin 


Cys 


Tyr 


Asn 


He 


Leu Gly Ser Phe He 


Cys 


Gin 


Cys 


Asn Gin Gly 










165 


170 








175 


Tyr 


Glu 


Leu 


Ser 


Ser 


Asp Arg Leu Asn Cys 


Glu 


Asp 


He Asp Glu Cys 






180 




185 








190 
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Arg 


Thr 


Ser 
195 


ser 


Tyr 


Leu Cys 


Gin Tyr Gin Cys Val Asn Glu Pro Gly 
200 205 


Lys 


Phe 


Ser 


Cys 


Met 


Cys Pro Gin Gly Tyr Gin Val Val Arg Ser Arg 




210 








215 


220 , 


Thr 


Cys 


Gin 


Asp 


lie 


Asn Glu 


Cys Glu Thr. Thr Asn Glu Cys Arg Glu 


o o c 

225 










230 


235 240 


Asp 


Glu 


Met 


Cys 


Trp 


Asn Tyr His Gly Gly Phe Arg Cys Tyr Pro Arg 










245 




250 255 


Asn 


Pro 


Cys 


Gin 
260 


Asp 


Pro Tyr 


He Leu Thr Pro Glu Asn Arg Cys Val 
265 270 


Cys 


Pro 


Val 
275 


Ser 


Asn 


Ala Met 


Cys Arg Glu Leu Pro Gin Ser He Val 
280 285 


Tyr 


Lys 


Tyr 


Met 


Ser 


He Arg 


Ser Asp Arg Ser Val Pro Ser Asp He 




290 








295 


300 


Phe 


Gin 


— * • 

He 


Gin 


Ala 


Thr Thr 


He Tyr Ala Asn Thr lie Asn Thr Phe 


305 










310 


315 320 


Arg 


He 


Lys 


Ser 


Gly 
325 


Asn Glu 


Asn Gly Glu Phe Tyr Leu Arg Gin Thr 
330 335 


Ser 


Pro 


Val 


Ser 
340 


Ala 


Met Leu 


Val Leu Val Lys Ser Leu Ser Gly Pro 
345 350 


Arg 


Glu 


His 
355 


He 


Val 


Asp Leu 


Glu Met Leu Thr Val Ser Ser He Gly 
360 . 365 


Thr 


Phe Arg 


Thr 


Ser 


Ser Val 


Leu Arg Leu Thr He He Val Gly Pro 




370 








375 


380 


Phe 


Ser 


Phe 








• 



385 

<210> 105 
<211> 531 
<212> PRT 

<213> Homo sapiens 



<400> 105 



Met 


Ser 


Lys 


Pro 


His 


Ser 


Glu Ala 


Gly Thr Ala Phe 


lie 


Gin Thr Gin 


1 








5 






10 




15 


Gin 


Leu 


His 


Ala 


Ala 


Met Ala Asp Thr Phe Leu Glu 


His 


Met Cys Arg 








20 








25 




30 


Leu 


Asp 


lie 


Asp 


Ser 


Pro 


Pro lie 


Thr Ala Arg Asn Thr Gly lie lie 






35 








40 




45 




Cys 


Thr 


He 


Gly 


Pro 


Ala 


Ser Arg 


Ser Val Glu Thr 


Leu 


Lys Glu Met 




50 










55 


60 






He 


Lys 


Ser 


Gly 


Met 


Asn 


Val Ala 


Arg Leu Asn Phe 


Ser 


His Gly Thr 


65 










70 




75 




80 


His 


Glu 


Tyr 


His 


Ala 


Glu 


Thr He 


Lys Asn Val Arg 


Thr 


Ala Thr Glu 










85 






90 




95 


Ser 


Phe 


Ala 


Ser 


Asp 


Pro 


Tyr Leu 


Tyr Arg Pro Val 


Ala 


Val Ala Leu 








100 








105 




110 


Asp 


Thr 


Lys 


Gly 


Pro 


Glu 


lie Arg 


Thr Gly Leu lie 


Lys 


Gly Ser Gly 






115 








120 




125 




Thr 


Ala 


Glu 


Leu 


Glu 


Leu 


Lys Lys 


Gly Ala Thr Leu 


Lys 


He Thr Leu 




130 










135 


140 






Asp 


Asn 


Ala 


Tyr 


Met 


Glu 


Lys Cys 


Asp Glu Asn lie 


Leu Trp Leu Asp 


145 










150 




155 




160 


Tyr 


Lys 


Asn 


He 


Cys 


Lys 


Val Val 


Glu Val Gly Ser 


Lys 


He Tyr Val 










165 






170 




175 


Asp 


Asp 


Gly 


Leu 


He 


Ser 


Leu Gin 


Val Lys Gin Lys Gly Ala Asp Phe 








180 








185 




190 
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Leu 


vai 


1 LIT 


olU 


vai 


oiu Hsn 


pit* 
uiy 






1 QS 

X J J 








onn 


vai 


TV e n 


Leu 


rro 


uiy 


TV 1 -a 7A 1 a 
Ala nla 


vax 
















ASp 


lie 


oin 


^\ b""^ B^Mh 

ASp 


Leu 


T lie DWa 

Lys «rne 


v>xy 


£.£. D 










230 




Phe 


TV I _ 

Ala 


Ser 


Pne 


lie 


Arg Lys 


Ala 
















Val 


Leu 


Gly 


Glu 


Lys 


Gly Lys 


Asn 








OCA 








Asn 


HIS 


GlU 


Gly 


val 


Arg Arg 


Phe 






«£ fD 








280 


GXy 


lie 


Met 


vai 


Aia 


Arg Gly Asp 




£ y u 








295 




T • »M 

Lys 


val 


Pne 


Leu 


Aia 


Gin Lys 


Met 












310 




Gly 


Lys 


Pro 


Val 


lie 


Cys Ala 


Thr 
















T B 

Lys 


Pro 


Arg 


Pro 


Tnr 


Arg Ala Glu 








•J Aft 








Leu 


Asp 


Gly 


TV 1 

Ala 


TV »m 

Asp 


Cys lie 


Met 






*5 H c 








360 


TV MM* 

ASp 


Tyr 


^« i r» ^>>. 

Pro 


Leu 


GlU 


Ala Val 


Arg 




J / u 








375 




Ala 


GlU 


Ala 


TV I _ 

Ala 


lie 


Tyr His 


Leu 


jo 3 










390 




Leu 


Aia 


pro 


Tl- 
116 


inr 


Ser Asp 


Pro 
















vai 


CjIU 


Aia 


O 

ber 


pne 


Lys Cys 


Cys 








H £. U 








Lys 


O w> 

ber 


tjiy 


Arg 


C? ^"fc V* 

ber 


Ala His 


Gin 






s o j 








440 


O v» 

fro 


lie 


T 1 a 

i ie 


Aia 


vai 


Thr Arg Asn 




sou 








455 




Leu 


lyr 


Arg 


biy 


Ti- 
ne 


Phe Pro 


Val 












470 




Ala 


Trp 


Ala 


Glu 


Asp 


Val Asp 


Leu 










485 






Gly 


Lys 


Ala 


Arg 


Gly 


Phe Phe 


Lys 








500 








Thr 


Gly 


Trp 


Arg 


Pro 


Gly Ser Gly 






515 








520 


Pro 


Val 


Pro 












530 













<210> 106 
<211> 480 
<212> PRT 

<213> Homo sapiens 
<400> 106 

Met Ala Ala Arg Cys Ser Thr Arg 

1 5 
Pro Arg Leu Pro Ala lie Ser Gly 

20 

Gly Val Val Gly Ala Trp Leu Ser 
35 40 
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oiy 


Ser Leu Gly 


Ser 


Lys Lys Gly 








205 




7A T""\ 

ASp 


Leu Pro 


Ala 


Val 


Ser Glu Lys 






220 






vai 


Glu Gin Asp 


Val 


Asp Met Val 




235 






240 


ber 


Asp Val 


His 


Glu Val Arg Lys 




250 






255 


ti- 
ne 


Lys lie 


He 


Ser 


Lys He Glu 


o c n 
£ bo 








270 


ASp 


Glu He 


Leu 


Glu Ala Ser Asp 








285 




Leu 


Gly He 


Giu 


He 


Pro Ala Glu 






300 






Mer. 


He Gly Arg 


Cys 


Asn Arg Ala . 




315 






320 


bin 


Met Leu 


Glu 


Ser 


Met He Lys 




330 






335 


biy 


Ser Asp 


Val 


Ala 


Asn Ala Val 


OH D 








350 


T All 

Leu 


Ser Gly Glu 


Thr" Ala Lys Gly 








365 




Met 


Gin His 


Leu 


lie Ala Arg Glu 






380 






bin 


Leu Phe 


Glu 


Glu 


Leu Arg Arg 




395 






400 


xnr 


Glu Ala 


Thr 


Ala 


Val Gly Ala 




410 






415 


O A V s 

ber 


Gly Ala 


He 


He 


Val Leu Thr 


4 0 c 


■ 






430 


\; a 1 

vai 


Ala Arg Tyr 


Arg 


Pro Arg Ala 








445 




fro 


Gin Thr 


Ala 


Arg Gin Ala His 






460 




* 


Leu 


Cys Lys 


Asp 


Pro 


Val Gin Glu 




475 






480 


Arg 


Val Asn 


Phe 


Ala 


Met Asn Val 




490 






4 95 


Lys 


Gly Asp Val 


Val 


He Val Leu 


505 








510 


Phe 


Thr Asn 


Thr 


Met Arg Val Val 



525 



m 

Trp Leu Leu Val Val Vai Gly Thr 

10 15 
Arg Gly Ala Arg Pro Pro Arg Glu 
25 30 
Arg Lys Leu Ser Val Pro Ala Phe 

45 
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Ala 


ber oer 


Leu 








fro 


Giy vai 


ber 


DO 






Ala 


ber rne 


nlS 


lie 


Leu Giy 


vai 






100 


Tyr 


Tyr Gin 


Leu 




lib 




Pro 


Lys Ala 


Lys 




1 JO 




Leu 


Ser Asp 


GlU 


14 D 






Gly 


Pne Asp 


Pro 


Gly 


Pro Tnr 


val 






180 


Phe 


Ser Ser 


Ser 




1 7J 




bin 


biu iyr 


f ne 




Z 1 u 




Asn 


Lys GlU 


rne 


j 






Giy 


Lys Giy 


Asn 


Gly 


Giy ber 


PI tr 

Giy 






260 


Ser 


Thr Cys 


Arg 




2 / o 




Cys 


val vai 


Cys 




o ft 
290 




Met 


lie Pro 


Val 








^5 V* 

fro 


vai Giy 


Lys 




vai. riic 








340 


Ser 


He Ala 


Gin 




O «-> J 




T* w r 

iyr 


Pi «i Thr 

biu inr 


Tie 




J / w 




T \ r c* 

liys 


Tl n 7\ y* /** 

ixe rirg 


ClcC 


JO J 






iyr 


Giy Hsp 


n is 


Thr 


Ser Arg 


Gin 






420 


Asp 


Val Glu 


Gly 




435 




Ser 


Thr Met 


Asp 




450 




Glu 


Asp Glu 


Glu 



4 65 



Thr Ser Cys Gly 
55 

Leu Thr Gly Thr 
70 

Thr Ser Ala Pro 
85 

Pro Arg Asn Ala 

Ala Lys Lys Tyr 

120 

Glu Lys Phe Ser 
135 

Val Lys Arg Lys 
150 

Gly Ala Ser Gly 
165 

Asp Pro Glu Glu 

Ser Phe Gly Asp 

200 

Met Glu Leu Thr 
215 

Thr Val Asn He 
230 

Glu Pro Gly Thr 
245 

Met Glu Thr He 

Arg Cys Gly Gly 

280 

Arg Gly Ala Gly 
295 

Pro Ala Gly Val 
310 

Arg Glu He Phe 
325 

Arg Asp Gly Ala 

Ala Leu Leu Gly 

360 

Asn Val Thr He 
375 

Gly Gly Lys Gly 
390 

Tyr He His He 
405 

Gin Ser Leu He 

Thr Val Asn Gly 

440 

Ser Ser Ala Gly 
455 

Gly Phe Leu Ser 
470 



O v n 

fro 


Arg 




T a i t 

Leu 








60 


Lys 


His 


Asn 


Pro 






75 




Leu 


Ala 


Lys 


GlU 




90 






C* /S W< 

ber 


Gin 


Lys 


f* 1 ti 
GlU 


100 








HIS 


Pro Asp 


i nr 


Gin 


Leu 


Ala 


P 1 t« 

GlU 








140 


Gin 


Tyr Asp 


Ala 






155 




ber 


Gin 


His 


ber 




170 






Leu 


Phe Arg 


Lys 










f ne 


Gin 


Thr 


vai 


f ne 


Asn 


Gin 


Aid 








220 


Met 


Asp 


Thr 


Cys 






235 




Lys 


Val 


Gin 


nxS 




250 








Thr 


Gly 


fro 










TV ~- Af 

Arg 


Gly 


Ser 


ne 


Gin 


Ala 


Lys 


Gin 




* 




300 


Glu 


Asp Gly 


Gin 






315 




T I « 

lie 


Thr 


Phe 


Arg 




330 






Hop 


He 


His 


Cor 


345 










Thr 


Ala 


t\jm y 


Pro 

trXO 


Pro Gly 


Thr 








380 


He 


Pro Arg 


He 






395 




Lys 


He 


Arg 


Val 




410 






Leu 


Ser 


Tyr 


Ala 


425 








Val 


Thr 


Leu 


Thr 


Ser 


Lys 


Ala 


Arg 



460 

Lys Leu Lys Lys 



Leu Thr Leu Arg 

Phe He Cys Thr 

80 

Asp Tyr Tyr Gin 
95 

lie Lys Lys Ala 
110 

Asn Lys Asp Asp 
125 

Ala Tyr Glu Val 

Tyr Gly Ser Ala 

160 

Tyr Trp Lys Gly 
175 

He Phe Gly Glu 
190 

Phe Asp Gin Pro 
205 

Ala Lys Gly Val 

Glu Arg Cys Asn 

240 

Cys His Tyr Cys 
255 

Phe Val' Met Arg 
270 

He He Ser Pro 
285 

Lys Lys Arg Val 

Thr Val Arg Met 

320 

Val Gin Lys Ser 
335 

Asp Leu Phe He 
350 

Ala Gin Gly Leu 
365 

Gin Thr Asp Gin 

Asn Ser Tyr Gly 

400 

Pro Lys Arg Leu 
415 

Glu Asp Glu Thr 
430 

Ser Ser Gly Gly 
445 

Arg Glu Ala Gly 

Met Phe Thr Ser 

480 



<210> 107 
<211> 572 
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<212> PRT 

<213> Homo sapiens 



<400> 107 



Met 


Ala 


Ala 


Pro 


Arg 


Pro 


Ser 


Pro 


Ala 


He 


Ser 


Val 


Ser 


Val 


Ser 


Ala 


1 








5 










10 










15 




Pro 


Ala 


Phe 


Tyr 
20 


Ala 


Pro 


Gin 


Lys 


Lys. 
25 


Phe 


Gly 


Pro 


Val 


Val 
30 


Ala 


Pro 


Lys 


Pro 


Lys 


Val 


Asn 


Pro 


Phe 


Arg 


Pro Gly Asp 


Ser 


Glu 


Pro 


Pro 


Pro 






35 










40 










45 








Ala 


Pro 


Gly 


Ala 


Gin Arg Ala Gin 


Met 


Gly Arg 


Val 


Gly Glu 


He 


Pro 




50 










55 










60 










Pro 


Pro 


Pro 


Pro 


Glu Asp 


Phe 


Pro 


Leu 


Pro 


Pro 


Pro 


Pro 


Leu Ala Gly 


65 










70 










75 










80 


Asp 


Gly 


Asp 


Asp 


Ala Glu Gly Ala 


Leu Gly Gly 


Ala 


Phe 


Pro 


Pro 


Pro 










85 










90 










95 




Pro 


Pro 


Pro 


lie 
100 


Glu 


Glu 


Ser 


Pne 


Pro 
105 


Pro 


Ala 


Pro 


Leu 


Glu 
110 


Glu 


Glu 


lie 


Phe 


Pro 


Ser 


Pro 


Pro 


Pro 


Pro 


Pro 


Glu 


Glu 


Glu Gly Gly 


Pro 


Glu 






115 










120 










125 








Ala 


Pro 


lie 


Pro 


Pro 


Pro 


Pro 


Gin 


Pro Arg 


Glu 


Lys 


Val 


Ser 


Ser 


He 




130 










IOC 

135 










140 










Asp 


Leu 


Glu 


lie 


Asp 


Ser 


T — - - - 

Leu 


Ser 


Ser 


Leu 


Leu 


Asp Asp 


Met 


Thr 


Lys 


145 










150 










155 








• 


160 


Asn 


Asp 


Pro 


Phe 


Lys 

1 UJ 


Ala 


Arg 


val 


Ser 


Ser 
170 


Gly 


Tyr 


Val 


Pro 


Pro 
175 


Pro 


Val 


Ala 


Thr 


Pro 


f ne 


ber 


ber 


Lys 


Ser 


Ser 


Thr 


Lys 


Pro 


Ala Ala Gly 








180 










185 










190 






Gly 


Thr 


Ala 
195 


Pro 


Leu 


a 

Fro 


T\ M 

fro 


Trp 

2UU 


Lys 


Ser 


Pro 


Ser 


Ser 
205 


Ser 


Gin 


Pro 


• Leu 


Pro 
210 


Gin 


Val 


fro 


Aia 


fro 

215 


H±a 


Gin 


Ser 


Gin 


Thr 
220 


Gin 


Phe 


His 


Val 


Gin 


Pro 


Gin 


Pro 


Gin 


Pro 


Lys 


Pro 


Gin 


Val 


Gin 


Leu 


His 


Val 


Glri Ser 


225 










2JU 










235 










240 


Gin 


Thr 


Gin 


Pro 


Val 


Ser 


Leu 


Ml a 


Asn 


Thr 


Gin 


Pro Arg 


Gly 


Pro 


Pro 










245 










250 










255 




Ala 


Ser 


Ser 


Pro 

260 


Ala 


Pro 


Ala 


Pro 


Lys 
265 


Phe 


Ser 


Pro 


Val 


Thr 
270 


Pro 


Lys 


Phe 


Thr 


Pro 


Val 


Ala 


Ser 


Lys 


Phe 


Ser 


Pro Gly 


Ala 


Pro Gly Gly Ser 






O T C 










280 










285 








Gly 


Ser 


Gin 


Pro 


Asn 


Gin 


Lys 


Leu 


Gly His 


Pro 


Glu 


Ala 


Leu 


Ser 


Ala 














295 










300 










Gly 


Thr 


Gly 


Ser 


Pro 


Gin 


Pro 


Pro 


Ser 


Phe 


Thr 


Tyr Ala 


Gin Gin Arg 


305 










310 










315 










320 


Glu 


Lys 


Pro 


Arg 


Val 


Gin 


Glu 


Lys 


Gin 


His 


Pro 


Val 


Pro 


Pro 


Pro 


Ala 








325 










330 










335 




Gin 


Asn 


Gin 


Asn 


Gin 


Val 


Arg 


Ser 


Pro Gly Ala 


Pro Gly 


Pro 


Leu 


Thr 








340 










345 










350 






Leu 


Lys 


Glu 


Val 


Glu 


Glu 


Leu 


Glu 


Gin 


Leu 


Thr 


Gin 


Gin 


Leu 


Met 


Gin 




355 










360 










365 








Asp 


Met 


Glu 


His 


Pro 


Gin 


Arg 


Gin 


Asn 


Val 


Ala 


Val 


Asn 


Glu 


Leu Cys 




370 










375 










380 










Gly 


Arg 


Cys 


His 


Gin 


Pro 


Leu 


Ala 


Arg Ala 


Gin 


Pro 


Ala 


Val 


Arg 


Ala 


385 










390 










395 










400 


Leu 


Gly 


Gin 


Leu 


Phe 
405 


His 


He 


Ala 


Cys 


Phe 
410 


Thr 


Cys 


His 


Gin 


Cys 
415 


Ala 


Gin 


Gin 


Leu 


Gin 


Gly Gin Gin 


Phe 


Tyr 


Ser 


Leu 


Glu Gly Ala 


Pro 


Tyr 
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420 




425 








a 

sou 




CZ \ vi 

VJXU 


Gly Cys 


lyr 


inr nap iiiL ijtsu 


Gl ii 


Lys 


Cys Asn 


inr cys Gly 






435 






A A 0 






445 




V?1U 


f ITO 


He 


Thr 


A. o r\ 

ASp 


Arg Met Lieu niy 


c\± d. 


Thr Gly Lys 


Aia Tyr His 




1 Jw 














a fin 




r$ ^-x 

r J.0 


nis 


Cys 


Phe 


i nr 


cys vai vai cys 


Aia 


7\ M M> 

Arg 


fro Lieu 


Glu Gly Tnr 












H / u 








AQC\ 
HO\J 


C A V* 


Oho 


He 


Val 


ASp 


bin Aia Asn Arg 


rro 


His 


mm \ 

Cys Val 


Pro Asp Tyr 










A R =i 
4 0 O 




A Qfi 






4 y o 


MIS 


T tic 

L»ys 


Gin 


Tyr 


Ala 


fro Arg cys oer 


Vdi 


Cys 


Ser Glu 


fro lie Met 








500 












01U 


Pro 


Glu 


Pro Gly 


Arg 


Asp Glu Tnr val 


Arg 


Val 


Val Ala 


• 4k w 

Leu Asp Lys 






515 






520 






525 




Asn 


Phe 


His 


Met 


Lys 


Cys Tyr Lys Cys 


Glu 


Asp 


Cys Gly 


Lys Pro Leu 




530 








535 






540 




Ser 


He 


Glu 


Ala 


Asp 


Asp Asn Gly Cys 


Phe 


Pro 


Leu Asp 


Gly His Val 


545 










550 




555 




560 


Leu 


Cys 


Arg 


Lys 


Cys 


His Thr Ala Arg 


Ala 


Gin 


Thr 












565 




570 









<210> 108 

<211> 2861 

<212> PRT 

<213> Homo sapiens 

<400> 108 



Met 


Lys 


Ala 


Met 


Asp 


Val 


Leu 


Pro 


He 


Leu 


Lys 


Glu Lys 


Val 


Ala 


Tyr 


1 








5 










10 










15 




Leu 


Ser 


Gly 


Gly 


Arg 


Asp Lys Arg Gly 


Gly 


Pro 


He 


Leu 


Thr 


Phe 


Pro 








20 










25 










30 






Ala 


Arg 


Ser 


Asn 


His 


Asp Arg 


He 


Arg 


Gin 


Glu 


Asp 


Leu 


Arg 


Arg 


Leu 






35 










40 










45 








He 


Ser 


Tyr 


Leu 


Ala 


Cys 


He 


Pro 


Ser 


Glu 


Glu 


Val 


Cys 


Lys Arg Gly 




50 










55 










60 










Phe 


Thr 


Val 


He 


Val 


Asp Met Arg Gly 


Ser 


Lys 


Trp Asp 


Ser 


He 


Lys 


65 










70 










75 










80 


Pro 


Leu 


Leu 


Lys 


He 


Leu 


Gin 


Glu 


Ser 


Phe 


Pro 


Cys 


Cys 


He 


His 


Val 










85 










90 










95 




Ala 


Leu 


He 


He 


Lys 


Pro Asp Asn 


Phe 


Trp 


Gin 


Lys 


Gin 


Arg 


Thr 


Asn 








100 










105 










110 






Phe 


Gly 


Ser 


Ser 


Lys 


Phe 


Glu 


Phe 


Glu 


Thr 


Asn 


Met 


Val 


Ser 


Leu 


Glu 






115 










120 










125 








Gly 


Leu 


Thr 


Lys 


Val 


Val 


Asp 


Pro 


Ser 


Gin 


Leu 


Thr 


Pro 


Glu 


Phe 


Asp 




130 










135 










140 










Gly 


Cys 


Leu 


Glu 


Tyr 


Asn 


His 


Glu 


Glu 


Trp 


He 


Glu 


He 


Arg 


Val 


Ala 


145 










150 










155 










160 


Phe 


Glu 


Asp 


Tyr 


He 


Ser 


Asn 


Ala 


Thr 


His 


Met 


Leu 


Ser 


Arg 


Leu 


Glu 










165 










170 










175 




Glu 


Leu 


Gin 


Asp 


He 


Leu 


Ala 


Lys 


Lys 


Glu 


Leu 


Pro 


Gin 


Asp 


Leu 


Glu 








180 










185 










190 






Gly 


Ala 


Arg 


Asn 


Met 


He 


Glu 


Glu 


His 


Ser 


Gin 


Leu 


Lys 


Lys 


Lys 


Val 






195 










200 










205 








He 


Lys 


Ala 


Pro 


He 


Glu 


Asp 


Leu Asp 


Leu 


Glu Gly Gin 


Lys 


Leu 


Leu 




210 










215 










220 










Gin 


Arg 


He 


Gin 


Ser 


Ser 


Glu 


Ser 


Phe 


Pro 


Lys 


Lys 


Asn 


Ser Gly Ser 


225 










230 










235 










240 


Gly 


Asn 


Ala 


Asp 


Leu 


Gin 


Asn 


Leu 


Leu 


Pro 


Lys 


Val 


Ser 


Thr 


Met 


Leu 
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Asp Arg 


Leu 


His 








260 


Arg 


Lys 


Leu 


Lys 






0 T C 

275 




Asp Ala 




i»ys 




290 






Leu 


Asn 


Ser 


Tyr 


305 








Leu 


Gin 


Thr 


Gin 


Val* 


Asn 


He 


Asn 








340 


Gly His 


Tyr 


Ala 






355 




Gin 


Glu 


Trp 


Lys 




370 






Leu Asp 


Met 


Ser 


385 








Asn 


Val 


Asp 


Ser 


Glu 


Leu 


Gin 


Asp 








420 


Glu 


His 


Tl- 


l nr 






435 




Leu 


Leu 


Asp 


Lys 




450 






Leu 


Thr 


Ala 


Ser 


465 








Val 


He 


His 


Glu 


Gin 


His 


Arg 


Lys 








500 


Gin 


Gin 


GlU 


val 






515 




Ala 


Phe 


Leu 


C V* 

oer 




530 






Arg Ala 


Leu 


Cain 


545 








Thr Tyr 


i nr 


Asn 


Gin 


Thr 


Gly 


Glu 








580 


Leu 


Glu 


Asp 


Arg 






C ft c 




He 


Leu 


Leu 


ASp 




610 






Trp 


Thr 


Trp 


Leu 


625 








Ala 


Glu 


Ser 


Val 


Gin 


Gin 


Gin 


Thr 








660 


Glu Asp 


Leu 


He 






675 




Thr 


Pro 


His 


Asn 



690 



1 j r 

*4 5 








Ser 


Thr Arg Gin 


Leu 


Asp 


Gin 


Cys 








280 


Met 


Phe Asp Trp 










Thr 


Glu 


He 


Gly 




310 






His 


Asn 


His 


Phe 


325 








Arg 


He 


Met 


Ser 


Ser 


Gin 


Gin 


He 








360 


Ala 


Phe 


Ala 


Ala 






375 




Ser 


He 


Phe 


His 




390 






Trp 


Cys 


Lys 


Ala 


405 








Leu 


Glu Asp Ala 


Leu 


Ala Tyr Ser 








440 


Leu 


Gin Arg 


Pro 






455 




Ala 


Asn Tyr Ser 




470 






Val 


Leu 


His 


His 


485 








Val 


Arg Leu His 


Gin 


Gin 


Val 


Leu 








520 


Lys 


His 


Thr Gly 






535 




Lys 


Arg 


His 


Glu 




550 






Ala 


Asp 


Lys 


Leu 


cue 

565 








Cys 


Asp 


Pro 


Glu 


He 


Gin Asp 


Phe 








600 


Met 


Ser 


Val 


Ser 






615 




Glu 


Glu 


Leu 


Gin 




630 






Glu 


Ala 


Val 


Gin 


645 








Thr 


Leu 


Gin 


Val 


Gin 


Gin 


Leu Arg 








680 


Ser 


Ser 


He 


Asn 






695 







250 






• • • 

His 


Leu 


His 


Gin 


265 








Phe 


Gin 


Leu Arg 


He 


Thr 


His 


Asn 








300 


Thr 


Ser 


His 


Pro 






315 




Ala 


Met 


Asn 


Cys 




330 






• • * 

Val 


Ala 


Asn 


Arg 


345 








Arg 


Gin 


He 


Ala 


Ala 


Leu Asp Glu 








380 


Gin 


Lys 


Ala 


Glu 






395 




Cys 


Gly Glu 


Val 




410 






He 


His 


His 


His 


425 








Glu 


Val 


Ser 


Gin 


Leu 


Thr 


Pro Gly 








460 


Lys 


Ala 


Val 


His 






475 




Gin 


Arg 


His 


Val 




490 






Gin 


Arg Leu Gin 


505 








Asp 


Trp 


He 


Glu 


Val 


Gly Lys 


Ser 








540 


Asp 


Phe 


Glu 


Glu 






555 




Leu 


Glu 


Ala 


Ala 




570 






Glu 


He 


Tyr 


Gin 


CDC 

585 








Val 


Arg Arg 


Val 


Phe 


His 


Thr 


His 








620 


Lys 


Glu 


Leu 


Leu 






635 




Asp 


Leu 


He 


Lys 




650 






Thr 


Val 


Asn 


Val 


665 








Asp 


Ser 


Ala 


lie 


His 


He 


Glu 


Thr 








700 







255 




Met 


Trp 


His 


Val 




270 






Leu 


Phe 


Glu 


Gin 


285 








Lys 


Gly 


Leu 


Phe 


His 


Ala 


Met 


Glu 








320 


Met 


Asn 


Val 


Tyr 






335 




Leu 


Val 


Glu 


Ser 




350 






Ser 


Gin 


Leu 


Glu 


365 








Arg 


Ser 


Thr 


Leu 


Lys 


Tyr 


Met 


Ser 








400 


Asp 


Leu 


Pro 


Ser 






415 




Gin 


Gly 


He 


Tyr 




430 






Asp 


Gly 


Lys 


Ser 


445 








Ser 


Ser 


Asp 


Ser 


His 


Val 


Leu 


Asp 








480 


Arg 


Thr 


He 


Trp 






495 




Leu 


Cys 


Val 


Phe 




510 






Asn 


His 


Gly 


Glu 


525 








Leu 


His 


Arg 


Ala 


Val 


Ala 


Gin 


Asn 








560 


Glu 


Gin 


Leu 


Ala 






575 




Ala 


Ala 


His 


Gin 




590 






Glu 


Gin 


Arg 


Lys 


605 








Val 


Lys 


Glu 


Leu 


Asp 


Asp 


Val 


Tyr 



640 

Arg Phe Gly Gin 
655 

He Lys Glu Gly 
670 

Ser Ser Asn Lys 
685 

Val Leu Gin Gin 
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Leu 


Asp 


— - * 

Glu 


Ala 


Gin 


Ser 


Gin 


Met 


Glu 


Glu Leu 


Phe 


Gin 


Glu 


Arg 


Lys 


705 










710 








715 










720 


He 


Lys 


Leu 


Glu 


Leu 
725 


Phe 


Leu 


His 


Val 


Arg He 
730 


Phe 


Glu 


Arg 


Asp 

~i rt r - 

735 


Ala 


He 


Asp 


He 


He 


Ser Asp 


Leu 


Glu 


Ser 


Trp Asn 


Asp 


Glu 


Leu 


Ser 


Gin 








740 










745 








750 






Gin 


Met 


Asn 


Asp 


Phe Asp 


Thr 


Glu 


Asp 


Leu Thr 


He 


Ala 


Glu 


Gin 


Arg 






755 










760 








765 








Leu 


Gin 
770 


His 


His 


Ala 


Asp 


Lys 
775 


Ala 


Leu 


Thr Met 


Asn 
780 


Asn 


Leu 


Thr 


Phe 


Asp 


Val 


He 


His 


Gin Gly 


Gin Asp 


Leu 


Leu Gin 


Tyr 


Val 


Asn 


Glu 


Val 


785 










M f\ f\ 

790 








795 










800 


Gin 


Ala 


Ser 


Gly 


Val 

a a 

805 


Glu 


Leu 


Leu 


Cys 


Asp Arg 
810 


Asp 


Val 


Asp 


Met 
815 


Ala 


Thr 


Arg 


Val 


Gin 


Asp 


Leu 


Leu 


Glu 


Phe 


Leu His 


Glu 


Lys 


Gin 


Gin 


Glu 






820 










825 








830 






Leu 


Asp 


Leu 
835 


Ala 


Ala 


Glu 


Gin 


His 
840 


Arg 


Lys His 


Leu 


Glu 
845 


Gin 


Cys 


Val 


Gin 


Leu 
850 


Arg 


His 


Leu 


Gin 


Ala 
855 


Glu 


Val 


Lys Gin 


Val 
860 


Leu 


Gly 


Trp 


He 


Arg 


Asn 


Gly 


Glu 


Ser 


Met 


Leu 


Asn 


Ala 


Gly Leu 


He 


Thr 


Ala 


Ser 


Ser 


865 










870 








875 










880 


Leu 


Gin 


Glu 


Ala 


Glu 
885 


Gin 


Leu 


Gin 


Arg 


Glu His 

/\ rt 

890 


Glu 


Gin 


Phe 


Gin 

rt rt ^ 

895 


• * * 

His 


Ala 


He 


Glu 


Lys 


Thr 


His 


Gin 'Ser 


Ala 


Leu Gin 


Val 


Gin 


Gin 


Lys 


— ^ 

Ala 








900 










905 








910 






Glu 


Ala 


Met 
915 


Leu 


Gin 


Ala 


Asn 


His 
920 


Tyr 


Asp Met 


Asp 


Met 
925 


He 


Arg 

♦ 


Asp 


Cys 


Ala 
930 


Glu 


Lys 


Val 


Ala 


Ser 
935 


His 


Trp 


Gin Gin 


Leu 
940 


Met 


Leu 


Lys 


Met 


Glu 


Asp 


Arg 


Leu 


Lys 


Leu 


Val 


Asn 


Ala 


Ser Val 


Ala 


Phe 


Tyr 


Lys 


Thr 


945 










950 








955 










rt /" rt 

960 


Ser 


Glu 


Gin 


Val 


Cys 
965 


Ser 


Val 


Leu 


Glu 


Ser Leu 
970 


Glu 


Gin 


Glu 


Tyr 
975 


Lys 


Arg 


Glu 


Glu 


Asp 


Trp Cys 


Gly Gly 


Ala 


Asp Lys 


Leu 


Gly 


Pro 


Asn 


Ser 








f\. A 

980 










985 








990 






Glu 


Thr 


Asp 


His 


Val 


Thr 


Pro 


Met 


He 


Ser Lys 


His 


Leu 


Glu 


Gin 


Lys 






995 










1000 






1005 






Glu 


Ala 


Phe 


Leu 


Lys 


Ala 


Cys 


Thr 


Leu 


Ala Arg 


Arg 


Asn 


Ala 


Asp 


Val 




1010 








1015 






1020 








Phe 


Leu 


Lys 


Tyr 


Leu 


His 


Arg Asn 


Ser 


Val Asn 


Met 


Pro Gly 


Met 


Val 


1025 








1030 






1035 








1040 


Thr 


His 


He 


Lys 


Ala 


Pro 


Glu 


Gin 


Gin 


Val Lys 


Asn 


He 


Leu 


Asn 


Glu 










1045 








1050 








1055 


Leu 


Phe 


Gin 


Arg 


Glu 


Asn Arg 


Val 


Leu 


His Tyr Trp Thr Met 


Arg Lys 








1060 








1065 






1070 





Arg Arg Leu Asp Gin Cys Gin Gin Tyr Val Val Phe Glu Arg Ser Ala 

1075 1080 1085 

Lys Gin Ala Leu Glu Trp He His Asp Asn Gly Glu Phe Tyr Leu Ser 

1090 1095 1100" 

Thr His Thr Ser Thr Gly Ser Ser He Gin His Thr Gin Glu Leu Leu 
1105 1110 1115 1120 

Lys Glu His Glu Glu Phe Gin He Thr Ala Lys Gin Thr Lys Glu Arg 

1125 1130 1135 

Val Lys Leu Leu He Gin Leu Ala Asp Gly Phe Cys Glu Lys Gly His 

1140 1145 1150 

Ala His Ala Ala Glu He Lys Lys Cys Val Thr Ala Val Asp Lys Arg 
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1155 1160 1165 

Tyr Arg Asp Phe Ser Leu Arg Met Glu Lys Tyr Arg Thr Ser Leu Glu 

1170 1175 1180 

Lys Ala Leu Gly lie Ser Ser Asp Ser Asn Lys Ser Ser Lys Ser Leu 
1185 1190 1195 1200 

Gin Leu* Asp lie lie Pro Ala Ser lie Pro Gly Ser Glu Val Lys Leu 

1205 1210 1215 

Arg Asp Ala Ala His Glu Leu Asn Glu Glu Lys Arg Lys Ser Ala Arg 

1220 1225 1230 

Arg Lys Glu Phe He Met Ala Glu Leu He Gin Thr Glu Lys Ala Tyr 

1235 1240 1245 

Val Arg Asp Leu Arg Glu Cys Met Asp Thr Tyr Leu Trp Glu Met Thr 

1250 1255 1260 

Ser Gly Val Glu Glu He Pro Pro Gly He Val Asn Lys Glu Leu He 
1265 1270 1275 1280 

He Phe Gly Asn Met Gin Glu He Tyr Glu Phe His Asn Asn He Phe 

1285 . 1290 1295 

Leu Lys Glu Leu Glu Lys Tyr Glu Gin Leu Pro Glu Asp Val Gly His 

1300 1305 1310 

Cys Phe Val Thr Trp Ala Asp Lys Phe Gin Met Tyr Val Thr Tyr Cys 

1315 1320 • 1325 

Lys Asn Lys Pro Asp Ser Thr Gin Leu He Leu Glu His Ala Gly Ser 

1330 1335 1340 

Tyr Phe Asp Glu He Gin Gin Arg His Gly Leu Ala Asn Ser lie Ser 
1345 1350 ' 1355 1360- 

Ser Tyr Leu He Lys Pro Val Gin Arg He Thr Lys Tyr Gin Leu Leu 

1365 1370 1375 

Leu Lys Glu Leu Leu Thr Cys Cys Glu Glu Gly Lys Gly Glu He Lys 

1380 1385 1390 

Asp Gly Leu Glu Val Met Leu Ser Val Pro Lys Arg Ala Asn Asp Ala 

1395 1400 1405 

Met His Leu Ser Met Leu Glu Gly Phe Asp Glu Asn He Glu Ser Gin 

1410 1415 1420 

Gly Glu Leu He Leu Gin Glu Ser Phe Gin Val Trp Asp Pro Lys Thr 
1425 1430 1435 1440 

Leu He Arg Lys Gly Arg Glu Arg His Leu Phe Leu Phe Glu Met Ser 

1445 1450 1455 

Leu Val Phe Ser Lys Glu Val Lys Asp Ser Ser Gly Arg Ser Lys Tyr 

1460 1465 1470 

Leu Tyr Lys Ser Lys Leu Phe Thr Ser Glu Leu Gly Val Thr Glu His 

1475 1480 1485 

Val Glu Gly Asp Pro Cys Lys Phe Ala Leu Trp Val Gly Arg Thr Pro 

1490 1495 1500 

Thr Ser Asp Asn Lys He Val Leu Lys Ala Ser Ser He Glu Asn Lys 
1505 1510 1515 1520 

Gin Asp Trp He Lys His He Arg Glu Val He Gin Glu Arg Thr He 

1525 1530 1535 

His Leu Lys Gly Ala Leu Lys Glu Pro He His He Pro Lys Thr Ala 

1540 1545 1550 

Pro Ala Thr Arg Gin Lys Gly Arg Arg Asp Gly Glu Asp Leu Asp Ser 

1555 1560 1565 

Gin Gly Asp Gly Ser Ser Gin Pro Asp Thr He Ser He Ala Ser Arg 

1570 1575 1580 

Thr Ser Gin Asn Thr Leu Asp Ser Asp Lys Leu Ser Gly Gly Cys Glu 
1585 1590 1595 1600 

Leu Thr Val Val He His Asp Phe Thr Ala Cys Asn Ser Asn Glu Leu 

1605 1610 1615 
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Thr He Arg Arg Gly Gin Thr Val Glu Val Leu Glu Arg Pro His Asp 

1620 1625 1630 

Lys Pro Asp Trp Cys Leu Val Arg Thr Thr Asp Arg Ser Pro Ala Ala 

1635 1640 1645 

Glu Gly Leu Val Pro Cys Gly Ser Leu Cys He Ala His Ser Arg Ser 

1650 1655 1660 

Ser Met Glu Met Glu Gly He Phe Asn His Lys Asp Ser Leu Ser Val 
1665 1670 1675 1680 

Ser Ser Asn Asp Ala Ser Pro Pro Ala Ser Val Ala Ser Leu Gin Pro 

1685 1690 1695 

His Met He Gly Ala Gin Ser Ser Pro Gly Pro Lys Arg Pro Gly Asn 

1700 1705 1710 

Thr Leu Arg Lys Trp Leu Thr Ser Pro Val Arg Arg Leu Ser Ser Gly 

1715 1720 1725 

Lys Ala Asp Gly His Val Lys Lys Leu Ala His Lys His Lys Lys Ser 

1730 1735 1740 

Arg Glu Val Arg Lys Ser Ala Asp Ala Gly Ser Gin Lys Asp Ser Asp 
1745 1750 1755 1760 

Asp Ser Ala Ala Thr Pro Gin Asp Glu Thr Val Glu Glu Arg Gly Arg 

1765 1770 1775 

Asn Glu Gly Leu Ser Ser Gly Thr Leu Ser Lys Ser Ser Ser Ser Gly 

1780 1785 1790 

Met Gin Ser Cys Gly Glu Glu Glu Gly Glu Glu Gly Ala Asp Ala Val 

1795 1800 1805 

Pro Leu Pro Pro Pro Met Ala He Gin Gin His Ser Leu Leu Gin Pro 

1810 1815 1820 

Asp Ser Gin Asp Asp Lys Ala Ser Ser Arg Leu Leu Val Arg Pro Thr 
1825 1830 1835 1840 

Ser Ser Glu Thr Pro Ser Ala Ala Glu Leu Val Ser Ala He Glu Glu 

1845 1850 1855 

Leu Val Lys Ser Lys Met Ala Leu Glu Asp Arg Pro Ser Ser Leu Leu 

1860 1865 1870 

Val Asp Gin Gly Asp Ser Ser Ser Pro Ser Phe Asn Pro Ser Asp Asn 

1875 1880 1885 

Ser Leu Leu Ser Ser Ser Ser Pro He Asp Glu Met Glu Glu Arg Lys 

1890 1895 1900 

Ser Ser Ser Leu Lys Arg Arg His Tyr Val Leu Gin Glu Leu Val Glu 
1905 1910 1915 1920 

Thr Glu Arg Asp Tyr Val Arg Asp Leu Gly Tyr Val Val Glu Gly Tyr 

1925 1930 1935 

Met Ala Leu Met Lys Glu Asp Gly Val Pro Asp Asp Met Lys Gly Lys 

1940 1945 1950 

Asp Lys lie Val Phe Gly Asn He His Gin He Tyr Asp Trp His Arg 

1955 1960 1965 

Asp Phe Phe Leu Gly Glu Leu Glu Lys Cys Leu Glu Asp Pro Glu Lys 

1970 1975 1980 

Leu Gly Ser Leu Phe Val Lys His Glu Arg Arg Leu His Met Tyr He 
1985 1990 1995 2000 

Ala Tyr Cys Gin Asn Lys Pro Lys Ser Glu His He Val Ser Glu Tyr 

2005 2010 2015 

He Asp Thr Phe Phe Glu Asp Leu Lys Gin Arg Leu Gly His Arg Leu 

2020 2025 2030 

Gin Leu Thr Asp Leu Leu He Lys Pro Val Gin Arg He Met Lys Tyr 

2035 2040 2045 

Gin Leu Leu Leu Lys Asp Phe Leu Lys Tyr Ser Lys Lys Ala Ser Leu 

2050 2055 2060 

Asp Thr Ser Glu Leu Glu Arg Ala Val Glu Val Met Cys He Val Pro 
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2065 2070 2075 2080 

Arg Arg Cys Asn Asp Met Met Asn Val Gly Arg Leu Gin Gly Phe Asp 

2085 2090 2095 

Gly Lys lie Val Ala Gin Gly Lys Leu Leu Leu Gin Asp Thr Phe Leu 

2100 2105 2110 

Val Thr Asp Gin Asp Ala Gly Leu Leu Pro Arg Cys Arg Glu Arg Arg 

2115 2120 2125 

lie Phe Leu Phe Glu Gin lie Val lie Phe Ser Glu Pro Leu Asp Lys 

2130 2135 2140 

Lys Lys Gly Phe Ser Met Pro Gly Phe Leu Phe Lys Asn Ser lie Lys 
2145 2150 2155 2160 

Val Ser Cys Leu Cys Leu Glu Glu Asn Val Glu Asn Asp Pro Cys Lys 

2165 2170 2175 

Phe Ala Leu Thr Ser Arg Thr Gly Asp Val Val Glu Thr Phe lie Leu 

2180 2185 2190 

His Ser Ser Ser Pro Ser Val Arg Gin Thr Trp lie His Glu lie Asn 

2195 2200 2205 

Gin lie Leu Glu Asn Gin Arg Asn Phe Leu Asn Ala Leu Thr Ser Pro 

2210 2215 2220 

He Glu Tyr Gin Arg Asn His Ser Gly Gly Gly Gly Gly Gly Gly Ser 
2225 2230 2235 2240 

Gly Ala Ala Ala Gly Val Gly Ala Ala- Ala Ala Ala Gly Pro Pro Val 

2245 2250 2255 

Ala Ala Ala Ala Thr Val Ala Ala Pro Ala Ala Ala Ala Ala Pro Pro 

2260 2265 2270 

Ala Arg Ala Gly Ala Gly Pro Pro Gly Ser Pro Ser Leu Ser Asp Thr 

2275 2280 2285 

Thr Pro Pro Cys Trp Ser Pro Leu Gin Pro Arg Ala Arg Gin Arg Gin 

2290 2295 2300 

Thr Arg Cys Gin Ser Glu Ser Ser Ser Ser Ser Asn He Ser Thr Met 
2305 2310 2315 2320 

Leu Val Thr His Asp Tyr Thr Ala Val Lys Glu Asp Glu He Asn Val 

2325 2330 2335 

Tyr Gin Gly Glu Val Val Gin He Leu Ala Ser Asn Gin Gin Asn Met 

2340 2345 2350 

Phe Leu Val Phe Arg Ala Ala Thr Asp Gin Cys Pro Ala Ala Glu Gly 

2355 2360 2365 

Trp lie Pro Gly Phe Val Leu Gly His Thr Ser Ala Val He Val Glu 

2370 2375 2380 

Asn Pro Asp Gly Thr Leu Lys Lys Ser Thr Ser Trp His Thr Ala Leu 
2385 2390 2395 2400 

Arg Leu Arg Lys Lys Ser Glu Lys Lys Asp Lys Asp Gly Lys Arg Glu 

2405 2410 2415 

Gly Lys Leu Glu Asn Gly Tyr Arg Lys Ser Arg Glu Gly Leu Ser Asn 

2420 2425 2430 

Lys Val Ser Val Lys Leu Leu Asn Pro Asn Tyr He Tyr Asp Val Pro 

2435 2440 2445 

Pro Glu Phe Val He Pro Leu Ser Glu Val Thr Cys Glu Thr Gly Glu 

2450 2455 2460 

Thr Val Val Leu Arg Cys Arg Val Cys Gly Arg Pro Lys Ala Ser He 
2465 2470 2475 2480 

Thr Trp Lys Gly Pro Glu His Asn Thr Leu Asn Asn Asp Gly His Tyr 

2485 2490 . 2495 

Ser He Ser Tyr Ser Asp Leu Gly Glu Ala Thr Leu Lys He Val Gly 

2500 2505 2510 

Val Thr Thr Glu Asp Asp Gly He Tyr Thr Cys He Ala Val Asn Asp 
2515 2520 2525 
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Met Gly Ser Ala Ser Ser Ser Ala Ser Leu Arg Val Leu Gly Pro Gly 

2530 2535 2540 

Met Asp Gly lie Met Val Thr Trp Lys Asp Asn Phe Asp Ser Phe Tyr 
2545 2550 2555 2560 

Ser Glu .Val Ala Glu Leu Gly Arg Gly Arg Phe Ser Val Val Lys Lys 

2565 2570 2575 

Cys Asp Gin Lys Gly Thr Lys Arg Ala Val Ala Thr Lys Phe Val Asn 

2580 2585 2590 

Lys Lys Leu Met Lys Arg Asp Gin Val Thr His Glu Leu Gly lie Leu 

2595 2600 2605 

Gin Ser Leu Gin His Pro Leu Leu Val Gly Leu Leu Asp Thr Phe Glu 

2610 2615 2620 

Thr Pro Thr Ser Tyr lie Leu Val Leu Glu Met Ala Asp Gin Gly Arg 
2625 2630 2635 2640 

Leu Leu Asp Cys Val Val Arg Trp Gly Ser Leu Thr Glu Gly Lys lie 

2645 2650 2655 

Arg Ala His Leu Gly Glu Val Leu Glu Ala Val Arg Tyr Leu His Asn 

2660 2665 2670 

Cys Arg lie Ala His Leu Asp Leu Lys Pro Glu Asn lie Leu Val Asp 

2675 2680 2685 

Glu Ser Leu Ala Lys Pro Thr lie Lys Leu Ala Asp Phe Gly Asp Ala 

2690 2695 2700 

Val Gin Leu Asn Thr Thr Tyr Tyr lie His Gin Leu Leu Gly Asn Pro 
2705 2710 2715 2720 

Glu Phe Ala Ala Pro Glu lie lie Leu Gly Asn Pro Val Ser Leu Thr 

2725 2730 2735 

Ser Asp Thr Trp Ser Val Gly Val Leu Thr Tyr Val Leu Leu Ser Gly 

2740 2745 2750 

Val Ser Pro Phe Leu Asp Asp Ser Val Glu Glu Thr Cys Leu Asn lie 

2755 2760 2765 

Cys Arg Leu Asp Phe Ser Phe Pro Asp Asp Tyr Phe Lys Gly Val Ser 

2770 2775 2780 

Gin Lys Ala Lys Glu Phe Val Cys Phe Leu Leu Gin Glu Asp Pro Ala 
2785 2790 2795 2800 

Lys Arg Pro Ser Ala Ala Leu Ala Leu Gin Glu Gin Trp Leu Gin Ala 

2805 2810 2815 

Gly Asn Gly Arg Ser Thr Gly Val Leu Asp Thr Ser Arg Leu Thr Ser 

2820 2825 2830 

Phe lie Glu Arg Arg Lys His Gin Asn Asp Val Arg Pro lie Arg Ser 

2835 2840 2845 

lie Lys Asn Phe Leu Gin Ser Arg Leu Leu Pro Arg Val 
2850 2855 2860 

<210> 109 
<211> 271 
<212> PRT 

<213> Homo sapiens 
<400> 109 

Met Val Leu He Lys Glu Phe Arg Val Val Leu Pro Cys Ser Val Gin 

15 10 15 

Glu Tyr Gin Val Gly Gin Leu Tyr Ser Val Ala Glu Ala Ser Lys Asn 

20 25 30 

Glu Thr Gly Gly Gly Glu Gly He Glu Val Leu Lys Asn Glu Pro Tyr 

35 40 45 

Glu Lys Asp Gly Glu Lys Gly Gin Tyr Thr His Lys He Tyr His Leu 
50 55 60 
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Lys 


Ser 


Lys 


Val 


Pro 


Ala 


Phe 


Val 


65 










70 






Leu 


Val 


Phe 


His 


Glu Lys Ala Trp 










85 








He 


Val 


mmm ft 

Thr 


Asn 
100 


Glu 


Tyr 


Met 


Lys 


Thr 


Trp 


mm • 

His 


Lys 


Pro Asp Leu Gly 






115 










120 


Asp 


Pro 
130 


Asn 


Thr 


Trp 


Lys 


Thr 
135 


Val 


Asp 


Arg 


Ser 


Gin 


Val 


Glu 


Pro 


Ala 


145 










150 






Ala 


Leu 


Phe 


Gin 


Ser 
165 


Val 


Lys 


Thr 


Trp 


Lys 


Lys 


Glu 
180 


Leu 


Ala 


Asn 


Ser 


Tyr 


Lys 


Leu 
195 


Val 


Thr 


He 


Lys 


Phe 
200 


Val 


Glu 
210 


Asn 


Phe 


He 


Gin 


Lys 
215 


Gin 


His 


Arg 


Gin 


Leu 


Phe 


Cys 


Trp 


He 


225 










230 






Glu 


Asp 


He 


Arg 


Arg Met 


Glu Asp 










245 








Met 


Arg 


Lys 


Arg 


Gly Ser 


Val 


Arg 



260 



<210> 110 
<211> 233 
<212> PRT 

<213> Homo sapiens 



<400> 110 



Asn 


Ser 


Val 


Leu 


Asn Ser Asn Ala 


1 








5 


Gly 


Gly 


Ala 


Ala 


Gly His Pro Gly 








20 




He 


Leu 


Tyr 


Pro 


Gly Gly Asn Lys 






35 




40 


Pro 


Tyr 


Pro 


Cys 


Ala Glu Asp Glu 




50 






55 


Ala 


Ser 


Pro 


Thr 


Arg Gly Gly Asp 


65 








70 


Cys 


Arg 


Lys 


Arg 


Arg Lys Arg Cys 










85 


Gly 


Asn 


Tyr 


Cys 


Lys Asn Gly He 








100 




Phe 


Arg 


Gly 


Glu 


He Glu Glu Thr 






115 




120 


His 


Ser 


Thr 


Leu 


Asp Gly Tyr Ser 




130 






135 


Met 


Tyr 


His 


Thr 


Lys Gly Gin Glu 


145 








150 


Asp 


Cys 


Ala 


Ser 


Gly Leu Cys Cys 










165 


Cys 


Lys 


Pro 


Val 


Leu Lys Glu Gly 



180 



Arg 


Met 


He 


Ala 


Pro Glu Gly Ser 






75 








80 


Asn 


Ala 


Tyr 


Pro 


Tyr Cys Arg Thr 




90 










95 


Asp 


Asp 


Phe 


Phe 


He 


Lys 


He Glu 


105 


■ 








110 




Thr 


Leu 


Glu 


Asn 


Val 


His 


Gly Leu 










125 






Glu 


He 


Val 


His 


He 


Asp 


He Ala 








140 








Asp 


Tyr 


Lys 


Ala 


Asp Glu Asp Pro 






155 








160 


Lys 


Arg 


Gly 


Pro 


Leu 


Gly 


Pro Asn 




170 










175 


Pro 


Asp Cys 


Pro 


Gin 


Met 


Cys Ala 


185 










190 




Lys 


Trp 


Trp 


Gly 


Leu 


Gin 


Ser Lys 










205 






Glu 


Lys 


Arg 


He 


Phe 


Thr 


Asn Phe 








220 








Asp 


Lys 


Trp 


He 


Asp 


Leu 


Thr Met 






235 








240 


Glu 


Thr 


Gin 


Lys 


Glu 


Leu 


Glu Thr 




250 










255 


Gly 


Thr 


Ser 


Ala 


Ala 


Asp 


Val 


265 










270 





He 


Lys Asn 


Leu 


Pro 


Pro Pro Leu 




10 








15 


Ser 


Ala 


Val 


Ser 


Ala 


Ala Pro Gly 


25 










30 


Tyr 


Gin 


Thr 


He 


Asp Asn Tyr Gin 










45 




Glu 


Cys 


Gly Thr 


Asp Glu Tyr Cys 








60 






Ala 


Gly Val 


Gin 


He 


Cys Leu Ala 






75 






80 


Met 


Arg His Ala 


Met 


Cys Cys Pro 




90 








95 


Cys 


Val 


Ser 


Ser 


Asp 


Gin Asn His 


105 










110 


He 


Thr 


Glu 


Ser 


Phe 


Gly Asn Asp 










125 




Arg 


Arg 


Thr 


Thr 


Leu 


Ser Ser Lys 








140 






Gly 


Ser 


Val 


Cys 


Leu Arg Ser Ser 






155 






160 


Ala 


Arg 


His 


Phe 


Trp 


Ser Lys He 




170 








175 


Gin 


Val 


Cys 


Thr 


Lys 


His Arg Arg 


185 










190 
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Lys Gly Ser His 
195 

Gly Leu Ser Cys 
210 

Ser Arg Leu His 
225 



Gly Leu Glu lie 

200 

Arg lie Gin Lys 
215 

Thr Cys Gin Arg 
230 



Phe Gin Arg Cys 

Asp His His Gin 

220 

His 



Tyr Cys Gly Glu 
205 

Ala Ser Asn Ser 



<210> 111 
<211> 212 
<212> PRT 
<213> Homo sapiens 



<400> 111 



Asn 


Ser 


Val 


Leu 


Asn 


Ser Asn Ala 


1 








5 




Gly 


Gly 


Ala 


Ala 


Gly 


His Pro Gly 








20 






lie 


Leu 


Tyr 


Pro 


Gly 


Gly Asn Lys 






35 






40 


Pro 


Tyr 


Pro 


Cys 


Ala 


Glu Asp Glu 




50 








55 

• 


Ala 


Ser 


Pro 


Thr 


Arg 


Gly Gly Asp 


65 










70 


Cys 


Arg 


Lys 


Arg 


Arg 


Lys Arg Cys 










85 




Gly 


Asn 


Tyr 


Cys 


Lys 


Asn Gly He 








100 






Phe 


Arg 


Gly 


Glu 


He 


Glu Glu Thr 






115 






120 


His 


Ser 


Thr 


Leu 


Asp 


Gly Tyr Ser 




130 








135 


Met 


Tyr 


His 


Thr 


Lys 


Gly Gin Glu 


145 










150 


Asp 


Cys 


Ala 


Ser 


Gly 


Leu Cys Cys 










165 




Cys 


Lys 


Pro 


Val 


Leu 


Lys Glu Gly 








180 






Lys 


Gly 


Ser 


His 


Gly 


Leu Glu He 






195 






200 


Gly 


Leu 


Ser 


Cys 








210 


• 









He Lys Asn 


Leu 


Pro 


Pro 


Pro 


Leu 


10 








15 




Ser Ala Val 


Ser 


Ala 


Ala 


Pro Gly 


25 






30 






Tyr Gin Thr 


He 


Asp 


Asn 


Tyr 


Gin 






45 








Glu Cys Gly Thr 


Asp 


Glu 


Tyr 


Cys 




60 










Ala Gly Val 


Gin 


He 


Cys 


Leu 


Ala 


75 










80 


Met Arg His 


Ala 


Met 


Cys 


Cys 


Pro 


90 








95 




Cys Val Ser 


Ser 


Asp 


Gin 


Asn 


His 


105 






110 






He Thr Glu 


Ser 


Phe 


Gly 


Asn Asp 






125 








Arg Arg Thr Thr 


Leu 


Ser 


Ser 


Lys 




140 










Gly Ser Val 


Cys 


Leu 


Arg 


Ser 


Ser 


155 










160 


Ala Arg His 


Phe 


Trp 


Ser 


Lys 


He 


170 








175 




Gin Val Cys 


Thr 


Lys 


His 


Arg 


Arg 


185 






190 






Phe Gin Arg 


Cys 


Tyr 


Cys 


Gly Glu 






205 









<210> 112 
<211> 149 
<212> PRT 
<213> Homo sapiens 

<400> 112 

Asn Ser Val Leu Asn Ser Asn Ala 

1 5 
Gly Gly Ala Ala Gly His Pro Gly 

20 

He Leu Tyr Pro Gly Gly Asn Lys 

35 40 
Pro Tyr Pro Cys Ala Glu Asp Glu 



He Lys Asn Leu 


Pro 


Pro 


Pro 


Leu 


10 






15 




Ser Ala Val Ser 


Ala 


Ala 


Pro 


Gly 


25 




30 




Tyr Gin Thr He 


Asp 


Asn 


Tyr 


Gin 




45 








Glu Cys Gly Thr 


Asp 


Glu 


Tyr 


Cys 
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50 


55 


60 










Ala 


Ser Pro Thr Arg 


Gly Gly Asp Ala 


Gly Val Gin 


He 


Cys 


Leu 


Ala 


65 




70 


75 








80 


Cys 


Arg Lys Arg Arg 


Lys Arg Cys Met 


Arg His Ala 


Met 


Cys 


Cys 


Pro 




85 




90 






95 




Gly 


Asn Tyr Cys Lys 


Asn Gly He Cys 


Val Ser Ser 


Asp 


Gin 


Asn 


His 




100 


105 






110 






Phe 


Arg Gly Glu He 


Glu Glu Thr He 


Thr Glu Ser 


Phe 


Gly Asn Asp 




115 


120 * 




125 








His 


Ser Thr Leu Asp 


Gly Tyr Ser Arg 


Arg Thr Thr 


Leu 


Ser 


Ser 


Lys 




130 


135 


140 










Met 


Tyr His Thr Lys 














145 

















<210> 113 
<211> 170 
<212> PRT 

<213> Homo sapiens 



<400> 113 


















Cys 


Ala 


Ser 


Pro 


Thr 


Arg Gly Gly Asp 


Ala Gly Val 


Gin 


He 


Cys 


Leu 


1 


* 






5 




10 






15 




Ala 


Cys 


Arg 


Lys 
20 


Arg 


Arg Lys Arg Cys 

25 


Met Arg His 


Ala 


Met 
30 


Cys 


Cys 


Pro 


Gly 


Asn 


Tyr 


Cys 


Lys Asn Gly He 


Cys Val Ser 


Ser 


Asp 


Gin 


Asn 






35 






40 




45 








His 


Phe 


Arg Gly 


Glu 


He Glu Glu Thr 


He Thr Glu 


Ser 


Phe Gly Asn 




50 








55 


60 










Asp 


His 


Ser 


Thr 


Leu 


Asp Gly Tyr Ser 


Arg Arg Thr 


Thr 


Leu 


Ser 


Ser 


65 










70 


75 








80 


Lys 


Met 


Tyr 


His 


Thr 
85 


Lys Gly Gin Glu 


Gly Ser Val 
90 


Cys 


Leu 


Arg 
95 


Ser 


Ser 


Asp 


Cys 


Ala 
100 


Ser 


Gly Leu Cys Cys 

105 


Ala Arg His 


Phe 


Trp 
110 


Ser 


Lys 


He 


Cys 


Lys 


Pro 


Val 


Leu Lys Glu Gly 


Gin Val Cys 


Thr 


Lys 


His Arg 






115 






120. 




125 








Arg 


Lys 


Gly Ser 


His 


Gly Leu Glu He 


Phe Gin Arg Cys 


Tyr 


Cys 


Gly 




130 








135 


140 










Glu 


Gly 


Leu 


Ser 


Cys 


Arg He Gin Lys 


Asp His His 


Gin 


Ala 


Ser 


Asn 


145 










150 


155 • 








160 


Ser 


Ser 


Arg 


Leu 


His 
165 


Thr Cys Gin Arg 


His 
170 











<210> 114 
<211> 128 
<212> PRT 

<213> Homo sapiens 



<400> 114 
Val Ser Ser Asp 
1 

Thr Glu Ser Phe 

20 

Arg Thr Thr Leu 



Gin Asn His Phe 
5 

Gly Asn Asp His 

Ser Ser Lys Met 

40 



Arg Gly Glu He 
10 

Ser Thr Leu Asp 
25 

Tyr His Thr Lys 



Glu Glu Thr He 
15 

Gly Tyr Ser Arg 
30 

Gly Gin Glu Gly 
45 
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Ser 


Val 


Cys 


Leu 


Arg 


Ser Ser Asp 




50 








55 


Arg 


His 


Phe 


Trp 


Ser 


Lys lie Cys 


65 










70 


Val 


Cys 


Thr 


Lys 


His 


Arg Arg Lys 










85 




Gin 


Arg 


Cys 


Tyr Cys 


Gly Glu Gly 








100 






His 


His 


Gin 


Ala 


Ser 


Asn Ser Ser 






115 






120 



Cys Ala Ser Gly Leu Cys Cys Ala 

60 

Lys Pro Val Leu Lys Glu Gly Gin 

75 80 
Gly Ser His Gly Leu Glu He Phe 

90 95 
Leu Ser Cys Arg He Gin Lys Asp 
105 110 
Arg Leu His Thr Cys Gin Arg His 

125 



<210> 115 
<211> 84 
<212> PRT 

<213> Homo sapiens 
<400> 115 



Gly Gin Glu 


Gly Ser 


Val 


Cys 


Leu 


1 


5 








Leu Cys Cys 


Ala Arg 


His 


Phe 


Trp 




20 








Lys Glu Gly 


Gin Val 


Cys 


Thr Lys 


35 








40 


Leu Glu He 


Phe Gin 


Arg 


Cys 


Tyr 


50 






55 




lie Gin Lys 


Asp His 


His 


Gin 


Ala 


65 




70 






Cys Gin Arg 


His 









Arg Ser Ser Asp Cys Ala Ser Gly 

10 15 
Ser Lys He Cys Lys Pro Val Leu 
25 30 
His Arg Arg Lys Gly Ser His Gly 

45 

Cys Gly Glu Gly Leu Ser Cys Arg 

60 

Ser Asn Ser Ser Arg Leu His Thr 
75 80 



<210> 116 
<211> 149 
<212> PRT 

<213> Homo sapiens 
<400> 116 



Cys 


Ala 


Ser 


Pro 


Thr 


Arg 


Gly Gly Asp 


Ala Gly Val Gin He Cys Leu 


1 








5 






10 


15 


Ala 


Cys 


Arg 


Lys 


Arg 


Arg 


Lys Arg Cys 


Met Arg His 


Ala Met Cys Cys 








20 






25 




30 


Pro 


Gly 


Asn 


Tyr 


Cys 


Lys 


Asn Gly He 


Cys Val Ser 


Ser Asp Gin Asn 






35 








40 




45 


His 


Phe 


Arg 


Gly 


Glu 


He 


Glu Glu Thr 


He Thr Glu 


Ser Phe Gly Asn 




50 










55 


60 




Asp 


His 


Ser 


Thr 


Leu 


Asp 


Gly Tyr Ser 


Arg Arg Thr 


Thr Leu Ser Ser 


65 










70 




75 


80 


Lys 


Met 


Tyr 


His 


Thr 


Lys 


Gly Gin Glu 


Gly Ser Val 


Cys Leu Arg Ser 










85 






90 


95 


Ser 


Asp 


Cys 


Ala 


Ser 


Gly 


Leu Cys Cys 


Ala Arg His 


Phe Trp Ser Lys 








100 






105 




110 


He 


Cys 


Lys 


Pro 


Val 


Leu Lys Glu Gly 


Gin Val Cys 


Thr Lys His Arg 






115 








120 




125 


Arg 


Lys 


Gly 


Ser 


His 


Gly 


Leu Glu He 


Phe Gin Arg Cys Tyr Cys Gly 




130 










135 


140 




Glu 


Gly 


Leu 


Ser 


Cys 











145 
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<210> 117 
<211> 107 
<212> PRT 

<213> Homo sapiens 
<400> 117 



Val 


Ser 


Ser 


Asp 


Gin Asn His Phe Arg Gly Glu He Glu Glu Thr He 


1 








5 


10 * 


15 


Thr 


Glu 


Ser 


Phe 


Gly Asn Asp His Ser 


Thr 


Leu Asp Gly Tyr Ser Arg 








20 


25 




30 


Arg 


Thr 


Thr 


Leu 


Ser Ser Lys Met Tyr 


His 


Thr Lys Gly Gin Glu Gly 




35 




40 




45 


Ser 


val 


Cys 


Leu 


Arg Ser Ser Asp Cys 


Ala 


Ser Gly Leu Cys Cys Ala 




50 






55 




60 


Arg 


His 


Phe 


Trp 


Ser Lys He Cys Lys 


Pro 


Val Leu Lys Glu Gly Gin 


65 








70 




75 80 


Val 


Cys 


Thr 


Lys 


His Arg Arg Lys Gly 


Ser 


His Gly Leu Glu He Phe 










85 


90 


95 


Gin 


Arg 


Cys 


Tyr 


Cys Gly Glu Gly Leu 


Ser 


Cys 








100 


105 







<210> 118 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<400> 118 

Gly Asn Lys Tyr Gin Thr He Asp Asn Tyr Gin Pro Tyr Pro Cys 
15. 10 15 



<210> 119 
<211> 22 
<212> PRT 

<213> Homo sapiens 
<400> 119 

Leu Asp Gly Tyr Ser Arg Arg Thr Thr Leu Ser Ser Lys Met Tyr His 

15 10 15 

Thr Lys Gly Gin Glu Gly 

20 



<210> 120 
<211> 21 
<212> PRT 

<213> Homo sapiens 
<400> 120 

Arg He Gin Lys Asp His His Gin Ala Ser Asn Ser Ser Arg Leu His 

15 10 15 

Thr Cys Gin Arg His 

20 
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<210> 121 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<400> 121 

Arg Gly Glu lie Glu Glu Thr lie Thr Glu Ser Phe Gly Asn Asp 
15 10 15 



<210> 122 
<211> 19 
<212> PRT 

<213> Homo sapiens 
<400> 122 

Glu lie Phe Gin Arg Cys Tyr Cys Gly Glu Gly Leu Ser Cys Arg lie 

15 10 15 

Gin Lys Asp 



<210> 123 

<211> 13 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> LRP5/HBM amino acid sequence 

<400> 123 

Met Tyr Trp Thr Asp Trp Val Glu Thr Pro Arg lie Glu 
1 5 10 



<210> 124 

<211> 13 

<212> PRT 

<213> Artificial Sequence 



<400> 124 

Met Tyr Trp Thr Asp Trp Gly Glu Thr Pro Arg lie Glu 



<210> 125 

<211> 13 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> LRP5/HBM amino acid sequence 



<220> 
<223> 



LRP5/HBM amino acid sequence 



1 



10 
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<400> 125 

Lys Arg Thr Gly Gly Lys Arg Lys Glu lie Leu Ser Ala 
15 10 



<210> 126 
<211> 17 
<212> PRT 

<213> Artificial Sequence 

■ 

<220> 

<223> LRP5/HBM amino acid sequence 
<400> 126 

Glu Arg Val Glu Lys Thr Thr Gly Asp Lys Arg Thr Arg lie Gin Gly 

15 10 15 

Arg 



<210> 127 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> LRP5/HBM amino acid sequence 
<400> 127 

Lys Gin Gin Cys Asp Ser Phe Pro Asp Cys lie Asp Gly Ser Asp Glu 
1 5 10 15 



<210> 128 
<211> 266 
<212> PRT 

<213> Homo sapiens 



<400> 128 



Met 


Met 


Ala 


Leu 


Gly 


Ala Ala Gly Ala 


Thr Arg Val Phe Val Ala Met 


1 








5 




10 15 


Val 


Ala 


Ala 


Ala 


Leu 


Gly Gly His Pro 


Leu Leu Gly Val Ser Ala Thr 








20 




25 


30 


Leu 


Asn 


Ser 


Val 


Leu 


Asn Ser Asn Ala 


He Lys Asn Leu Pro Pro Pro 






35 






40 


45 


Leu 


Gly 


Gly 


Ala 


Ala 


Gly His Pro Gly 


Ser Ala Val Ser Ala Ala Pro 




50 








55 


60 


Gly 


He 


Leu 


Tyr 


Pro 


Gly Gly Asn Lys 


Tyr Gin Thr He Asp Asn Tyr 


65 










70 


75 80 


Gin 


Pro 


Tyr 


Pro 


Cys 


Ala Glu Asp Glu 


Glu Cys Gly Thr Asp Glu Tyr 










85 




90 95 


Cys 


Ala 


Ser 


Pro 


Thr 


Arg Gly Gly Asp Ala Gly Val Gin He Cys Leu 








100 




105 


110 


Ala 


Cys 


Arg 


Lys 


Arg 


Arg Lys Arg Cys 


Met Arg His Ala Met Cys Cys 






115 






120 


125 


Pro 


Gly 


Asn 


Tyr 


Cys 


Lys Asn Gly He Cys Val Ser Ser Asp Gin Asn 




130 








135 


140 
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His Phe 


Arg Gly 


Glu He Glu 


Glu Thr 


lie 


Thr 


Glu 


Ser 


Phe Gly Asn 


145 






150 






155 






160 


Asp His 


Ser 


Thr 


Leu Asp Gly 


Tyr Ser 


Arg 


Arg 


Thr 


Thr 


Leu Ser Ser 








165 




170 








175 


Lys Met 


Tyr 


His 


Thr Lys Gly 


Gin Glu 


Gly Ser 


Val 


Cys 


Leu Arg Ser 






180 




185 










190 


Ser Asp 


Cys 


Ala 


Ser Gly Leu 


Cys Cys 


Ala 


Arg 


His 


Phe 


Trp Ser Lys 




195 






200 








205 




lie Cys 


Lys 


Pro 


Val Leu Lys 


Glu Gly 


Gin 


Val 


Cys Thr Lys His Arg 


210 

• 






215 








220 






Arg Lys 


Gly 


Ser 


His Gly Leu 


Glu He 


Phe 


Gin 


Arg 


Cys 


Tyr Cys Gly 


225 






230 






235 






' 240 


Glu Gly 


Leu 


Ser 


Cys Arg He 


Gin Lys 


Asp 


His 


His 


Gin 


Ala Ser Asn 








245 




250 








255 


Ser Ser 


Arg 


Leu 


His Thr Cys ^Cln Arg 


His. 








• 






260 




265 













<210> 129 
<211> 39 
<212> DNA 

<213> Artificial Sequence 

* 

<220> 

<223> complementary synthetic oligonucleotide 
<400> 129 

tggtcagcgg cctggaggat gtggccgcag tggacttcc 

<210> 130 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 130 

ggaagtccac tgcggccaca tcctccaggc cgctgacca 

<210> 131 
<211> 40 
<212> DNA 

<213> Artificial Sequence 

* 

<220> 

<223> complementary synthetic oligonucleotide 
<400> 131 

aagctgtact ggacggactc agtgaccaac cgcatcgagg 40 

<210> 132 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 



39 



39 
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<400> 132 

cctcgatgcg gttggtcact gagtccgtcc agtacagctt 

<210> 133 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 133 

atgtactgga cagactggaa ggagacgccc cggattgagc g 

<210> 134 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 134 

cgctcaatcc ggggcgtctc cttccagtct gtccagtaca t 

<210> 135 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 135 

atgtactgga cagactggtt tgagacgccc cggattgagc g 

<210> 136 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 136 

cgctcaatcc ggggcgtctc aaaccagtct gtccagtaca t 

<210> 137 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 137 
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atgtactgga cagactggat tgagacgccc cggattgagc g 

<210> 138 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 138 

cgctcaatcc ggggcgtctc aatccagtct gtccagtaca t 

<210> 139 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 139 

atgtactgga cagactggca ggagacgccc cggattgagc g 

<210> 140 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 140 

cgctcaatcc ggggcgtctc ctgccagtct gtccagtaca t 

<210> 141 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 141 

cggacattta ctggcccaat gtactgacca tcgacctgga gg 

<210> 142 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 142 

cctccaggtc gatggtcagt acattgggcc agtaaatgtc eg 
<210> 143 
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<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 143 

agctctactg ggctgacgtc aagctcagct tcatccaccg 

<210> 144 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 144 

cggtggatga agctgagctt gacgtcagcc cagtagagct 

<210> 145 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 145 

gagtgccctc tactcacccg tggacatcca ggtgctgagc c 

<210> 146 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 146 

ggctcagcac ctggatgtcc acgggtgagt agagggcact c 
<210> 147 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 147 

catgtactgg acagactggg tagagaaccc taaaatcgag tgtgc 

<210> 148 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> complementary synthetic oligonucleotide 
<400> 148 

gcacactcga ttttagggtt ctctacccag tctgtccagt acatg 

<210> 149 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 149 

catctactgg accgagtggg tcggcaagcc gaggatcgtg eg 

<t210> 150 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 150 

cgcacgatcc tcggcttgcc gacccactcg gtccagtaga tg 

<210> 151 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 151 

gtacttcacc aacatggtgg accgggcagc caagatcgaa eg 

<210> 152 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 152 

cgttcgatct tggctgcccg gtccaccatg ttggtgaagt ac 

<210> 153 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> complementary synthetic oligonucleotide 
<400> 153 

gtactggaca gactgggtag aagtgccaaa gatagaacgt gc 

<210> 154 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> complementary synthetic oligonucleotide 
<400> 154 

gcacgttcta tctttggcac ttctacccag tctgtccagt ac 

<210> 155 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 155 

ttttttgtcg accaattcca acgctatcaa g 

<210> 156 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 156 

ttttttgtcg acctgcgcta gtcccacccg c 

<210> 157 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 157 

ttttttgtcg accgtgtctt ctgatcaaaa tc 

<210> 158 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



Page 177 



WO 02/092015 



032796-132. ST25 



<400> 158 

ttttttgtcg accggacaag aaggttctgt ttg 

<210> 159 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 159 

ttttttgcgg ccgcttattt ggtgtgatac atttttg 

<210> 160 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 160 

ttttttgcgg ccgcttagca agacagacct tctcc 

<210> 161 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 161 

ttttttgcgg ccgcttagtg tctctgacaa gtgtg 

<210> 162 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 162 

cagtgaattc accatgcaaa acaccacttt gttc 

<210> 163 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 163 

cagttgcggc cgctcatctc cggtggcctc tg 
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<210> 164 

<211> 39 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> primer 
<400> 164 

caatagtcga cgaattcacc atggctctgg gcgcagcgg 39 

<210> 165 

<211> 42 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> primer 
<400> 165 

gtattgcggc cgctctagat tagtgtctct gacaagtgtg aa 42 

<210> 166 

<211> 32 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> primer 
<400> 166 

ctaacggatc caccatggcc gcgttgatgc gg 32 

<210> 167 

<211> 33 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> primer 
<400> 167 

gattcgaatt ctcaaatttt ctgacacaca tgg 33 

<210> 168 
<211> 690 
<212> DNA 
<213> Homo sapiens 

<400> 168 

aattccaacg ctatcaagaa cctgccccca ccgctgggcg gcgctgcggg gcacccaggc 60 
tctgcagtca gcgccgcgcc gggaatcctg tacccgggcg ggaataagta ccagaccatt 120 
gacaactacc agccgtaccc gtgcgcagag gacgaggagt gcggcactga tgagtactgc 180 
gctagtccca cccgcggagg ggacgcgggc gtgcaaatct gtctcgcctg caggaagcgc 240 
cgaaaacgct gcatgcgtca cgctatgtgc tgccccggga attactgcaa aaatggaata 300 
tgtgtgtctt ctgatcaaaa tcatttccga ggagaaattg aggaaaccat cactgaaagc 360 
tttggtaatg atcatagcac cttggatggg tattccagaa gaaccacctt gtcttcaaaa 420 
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atgtatcaca ccaaaggaca agaaggttct 
ggattgtgtt gtgctagaca cttctggtcc 
caagtgtgta ccaagcatag gagaaaaggc 
tactgtggag aaggtctgtc ttgccggata 
tctaggcttc acacttgtca gagacactaa 

<210> 169 

<211> 1226 

<212> DNA 

<213> Homo sapiens 



gtttgtctcc ggtcatcaga ctgtgcctca 480 
aagatctgta aacctgtcct gaaagaaggt 540 
tctcatggac tagaaatatt ccagcgttgt 600 
cagaaagatc accatcaagc cagtaattct 660 

690 



<400> 169 

ctcatcctgc ccctgcatgg actgaggaac gtcaaagcca tcgactatga cccactggac 60 

aagttcatct actgggtgga tgggcgccag aacatcaagc gagccaagga cgacgggacc 120 

cagccctttg ttttgacctc tctgagccaa ggccaaaacc cagacaggca gccccacgac 180 

ctcagcatcg acatctacag ccggacactg ttctggacgt gcgaggccac caataccatc 240 

aacgtccaca ggctgagcgg ggaagccatg ggggtggtgc tgcgtgggga ccgcgacaag 300 

cccagggcca tcgtcgtcaa cgcggagcga gggtacctgt acttcaccaa catgcaggac 360 

cgggcagcca agatcgaacg cgcagccctg gacggcaccg agcgcgaggt cctcttcacc 420 

accggcctca tccgccctgt ggccctggtg gtagacaaca cactgggcaa gctgttctgg 480 

gtggacgcgg acctgaagcg cattgagagc tgtgacctgt caggggccaa ccgcctgacc 540 

ctggaggacg ccaacatcgt gcagcctctg ggcctgacca tccttggcaa gcatctctac 600 

tggatcgacc gccagcagca gatgatcgag cgtgtggaga agaccaccgg ggacaagcgg 660 

actcgcatcc agggccgtgt cgcccacctc actggcatcc atgcagtgga ggaagtcagc 720 

ctggaggagt tctcagccca cccatgtgcc cgtgacaatg gtggctgctc ccacatctgt 780 

attgccaagg gtgatgggac accacggtgc tcatgcccag tccacctcgt gctcctgcag 840 

aacctgctga cctgtggaga gccgcccacc tgctccccgg accagtttgc atgtgccaca 900 

ggggagatcg actgtatccc cggggcctgg cgctgtgacg gctttcccga gtgcgatgac 960 

cagagcgacg aggagggctg ccccgtgtgc tccgccgccc agttcccctg cgcgcggggt 1020 

cagtgtggtg gacctgcgcc tgcgctgcga cggcgaggca gactgtcagg accgctcaga 1080 

cgaggcggac tgtgacgcca tctgcctgcc caaccagttc cggtgtgcga gcggccagtg 1140 

tgtcctcatc aaacagccag tgcgactcct tccccgactg tatcgacggc tccgacgagc 1200 

tcatgtgtga aatcaccaag ccgccc 1226 

<210> 170 
<211> 934 
<212> DNA 
<213> Homo sapiens 



<400> 170 

agggccatcg tcgtcaacgc ggagcgaggg tacctgtact tcaccaacat gcaggaccgg 60 

gcagccaaga tcgaacgcgc agccctggac ggcaccgagc gcgaggtcct cttcaccacc 120 

ggcctcatcc gccctgtggc cctggtggta gacaacacac tgggcaagct gttctgggtg 180 

gacgcggacc tgaagcgcat tgagagctgt gacctgtcag gggccaaccg cctgaccctg 240 

gaggacgcca acatcgtgca gcctctgggc ctgaccatcc ttggcaagca tctctactgg 300 

atcgaccgcc agcagcagat gatcgagcgt gtggagaaga ccaccgggga caagcggact 360 

cgcatccagg gccgtgtcgc ccacctcact ggcatccatg cagtggagga agtcagcctg 420 

gaggagttct cagcccaccc atgtgcccgt gacaatggtg gctgctccca catctgtatt 480 

gccaagggtg atgggacacc acggtgctca tgcccagtcc acctcgtgct cctgcagaac 540 

ctgctgacct gtggagagcc gcccacctgc tccccggacc agtttgcatg tgccacaggg 600 

gagatcgact gtatccccgg ggcctggcgc tgtgacggct ttcccgagtg cgatgaccag 660 

agcgacgagg agggctgccc cgtgtgcctc cgccgcccag ttcccctgcg cgcggggtca 720 

gtgtgtggac ctgcgcctgc gctgcgacgg cgaggcagac tgtcaggacc gctcagacga 780 

ggcggactgt gacgccatct ggcctgccca accagttccg gtgtgcgagc ggccagtgtg 840 

tcctcatcaa acagcagtgc gactccttcc ccgactgtat cgacggctcc gacgagctca 900 

tgtgtgaaat caccaagccg ccctaagcgg ccgc 934 
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<210> 171 

<211> 16 

<212> PRT. 

<213> Homo sapiens 

<400> 171 

Ser Val Gly Cys Leu Leu Cys Ala Gly Leu Gly Val Trp Ser Leu Ser 
1 5 10 15 



<210> 172 

<211> 19 

<212> PRT 

<213> Homo sapiens 

» 

<400> 172 

Trp Cys Cys Cys Gly Leu Phe Arg Gly Val Cys Val Trp Ser Cys Gly 

15 10 15 

Ala Asp Asp 



<210> 173 

<211> 16 

<212> PRT 

<213> Homo sapiens 

m 

<400> 173 

Gly Trp Arg Arg Cys Asp Trp Cys Gly Cys Val Ser Trp Cys Trp Val 
1 5 10 * 15 



<210> 174 

<211> 32 

<212> PRT 

<213> Homo sapiens 

<400> 174 

Met Pro Gly Ser Val Ser His Cys Trp Gly Gly He Cys Glu Ala Leu 

15 10 15 

Ser Cys Cys Ala Val Asp Val Cys Leu Arg Cys Gly Gly Trp Phe Arg 

20 25 30 



<210> 175 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 175 

Ser Cys Cys Ala Val Asp Val Cys Leu Arg Cys Gly Gly Trp Phe Arg 
1 5 10 15 



<210> 176 
<211> 16 
<212> PRT 



Page 181 



WO 02/092015 



PCT/US02/15982 



032796-132. ST25 



<213> Homo sapiens 
<400> 176 

Ser Val Leu Gly Thr Cys Cys Cys Cys Gly Gly Trp lie Leu Cys Glu 
15 10 15 

<210> 177 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 177 

Val Leu Ser Val Cys Glu Val Cys Gly Gly Val Phe Val Arg Arg Cys 
1 5 10 15 



<210> 178 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 178 

Gly Met Trp Tyr Trp Ser Gly Arg Asp Cys Ala Leu Cys Trp Leu 
1 5 10 15 

<210>179 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 179 

Cys Thr Ala Val Met Trp Gly Val Gly Ser Val Ala Tyr Leu Gly Glu 
1 5 10 15 



<210> 180 

<211> 13 

<212> PRT 

<213> Homo sapiens 

<400> 180 

Val Val Cys Trp Trp Cys Gly Cys Arg Gly Trp Trp Arg 
15 10 



<210> 181 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 181 

Cys Val Cys Ala Ser Phe Cys Cys Cys Val Cys Gly Leu Arg Leu Leu 
15 10 15 
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<210> 182 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 182 

Thr Tyr Glu Val Cys Glu Glu Cys Gly Gly Arg Val Arg Met Trp Val 
1 5 10 15 



<210> 183 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 183 

Val Val Val Cys Ala Ser Cys Gly Gin Val Trp His Gly Ser Gly Ala 
1 5 10 15 



<210> 184 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 184 

Cys Cys Arg Cys Cys His Cys Trp Asp Cys Glu Trp His Met Cys Val 
1 5 10 15 



<210> 185 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 185 

Phe Cys Ala Ser Cys Cys Trp Cys Gly Cys Asp Cys Phe Gly Trp Val 
1 5 10 ~ 15 



<210> 186 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 186 

Cys Asp Tyr Cys Trp Ser Cys Gly Val Trp Cys Pro Ser Ser Trp Leu 
15 10 15 



<210> 187 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 187 

Val Tyr Leu Cys Val Trp Cys Gly Ala Ala Arg Phe Gly Cys Tyr Gly 
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10 15 



<210> 188 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<40Q> 188 



Phe Cys Val Cys Gly Cys Cys Trp Cys Trp Cys Ala Ala Cys Trp Cys 
15 10 15 



<210> 189 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 189 

Val Val Leu Cys Ser Arg Cys Gly Arg Leu Trp Arg Trp Ser Cys Gly 
15 10 15 



<210> 190 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 190 

Glu Val Arg Gin Val Thr Cys He Arg Cys Arg Arg Gly Phe Leu Leu 
1 5 10 15 



<210> 191 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<400> 191 

Gly Gly Gly Gly Met Trp Glu Ala Trp Ser Cys Tyr Ala Cys Gly 
1 5 10 15 



<210> 192 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 192 

Gly Trp Arg Trp Cys Gly Arg Cys Gly Ala Leu Trp Trp Arg Arg Val 
1 5 10 is 



<210> 193 
<211> 485 
<212> DNA 

<213> Homo sapiens 
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<400> 193 

aagcttgcca ccatggagac agacacactc 
ggttccactg gtgacggatc catgagcgat 
gacacggatg tactcaaagc ggacggggcg 
ggtccgaatt ccgtggttct gtgttcgcgt 
actagtggtc cgtgcaaaat gatcgccccg 
ggcaaactga ccgttgcaaa actgaacatc 
ggcatccgtg gtatcccgac tctgctgctg 
gtgggtgcac tgtctaaagg tcagttgaaa 
gccgc 

<210> 194 

<211> 485 

<212> DNA 

<213> Homo sapiens 

<400> 194 

aagcttgcca ccatggagac agacacactc 

ggttccactg gtgacggatc catgagcgat 

gacacggatg tactcaaagc ggacggggcg 

ggtccgaatt ccgggtggcg gtggtgtggt 

actagtggtc cgtgcaaaat gatcgccccg 

ggcaaactga ccgttgcaaa actgaacatc 

ggcatccgtg gtatcccgac tctgctgctg 

gtgggtgcac tgtctaaagg tcagttgaaa 
gccgc 

<210> 195 
<211> 485 
<212> DNA 
<213> Homo sapiens 

<400> 195 

aagcttgcca ccatggagac agacacactc 

ggttccactg gtgacggatc catgagcgat 

gacacggatg tactcaaagc ggacggggcg 

ggtccgaatt ccgaggtgcg gcaggttacg 

actagtggtc cgtgcaaaat gatcgccccg 

ggcaaactga ccgttgcaaa actgaacatc 

ggcatccgtg gtatcccgac tctgctgctg 

gtgggtgcac tgtctaaagg tcagttgaaa 
gccgc 

<210> 196 
<211> 485 
<212> DNA 
<213> Homo sapiens 

<400> 196 

aagcttgcca ccatggagac agacacactc 
ggttccactg gtgacggatc catgagcgat 
gacacggatg tactcaaagc ggacggggcg 
ggtccgaatt ccggtggtgg ggggatgatt 
actagtggtc cgtgcaaaat gatcgccccg 
ggcaaactga ccgttgcaaa actgaacatc 
ggcatccgtg gtatcccgac tctgctgctg 



ctgctatggg tactgctgct ctgggttcca 60 
aaaattattc acctgactga cgacagtttt 120 
atcctcgtcg atttctgggc agagtggtgc 180 
tgtgggcgtt tgtggcggtg gtcgtgtggg 240 
attctggatg aaatcgctga cgaatatcag 300 
gatcaaaacc ctggcactgc gccgaaatat 360 
ttcaaaaacg gtgaagtggc ggcaaccaaa 420 
gagttcctcg acgctaacct ggcgtaagcg 480 

485 



ctgctatggg tactgctgct ctgggttcca 60 
aaaattattc acctgactga cgacagtttt 120 
atcctcgtcg atttctgggc agagtggtgc 180 
cggtgtgggg ctttgtggtg gcggcgtgtt 240 
attctggatg aaatcgctga cgaatatcag 300 
gatcaaaacc ctggcactgc gccgaaatat 360 
ttcaaaaacg gtgaagtggc ggcaaccaaa 420 
gagttcctcg acgctaacct ggcgtaagcg 480 

485 



ctgctatggg tactgctgct ctgggttcca 60 
aaaattattc acctgactga cgacagtttt 120 
atcctcgtcg atttctgggc agagtggtgc 180 
tgtattaggt gtcgtcgggg ttttctgttg 240 
attctggatg aaatcgctga cgaatatcag 300 
gatcaaaacc ctggcactgc gccgaaatat 360 
ttcaaaaacg gtgaagtggc ggcaaccaaa 420 
gagttcctcg acgctaacct ggcgtaagcg 480 

485 



ctgctatggg tactgctgct ctgggttcca 60 
aaaattattc acctgactga cgacagtttt 120 
atcctcgtcg atttctgggc agagtggtgc 180 
tgggaggctt ggagttgtta tgcgtgtggg 240 
attctggatg aaatcgctga cgaatatcag 300 
gatcaaaacc ctggcactgc gccgaaatat 360 
ttcaaaaacg gtgaagtggc ggcaaccaaa 420 
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gtgggtgcac tgtctaaagg tcagttgaaa gagttcctcg acgctaacct 
gccgc 

<210> 197 
<211> 485 
<212> DNA 
<213> Homo sapiens 

<400> 197 

aagcttgcca ccatggagac agacacactc ctgctatggg tactgctgct ctgggttcca 60 
ggttccactg gtgacggatc catgagcgat aaaattattc acctgactga cgacagtttt 120 
gacacggatg tactcaaagc ggacggggcg atcctcgtcg atttctgggc agagtggtgc 180 
ggtccgaatt ccttgtggat tgggccgggt gatcagggtc tgtttcggcg ttttgttttt 240 
actagtggtc cgtgcaaaat gatcgccccg attctggatg aaatcgctga cgaatatcag 300 
ggcaaactga ccgttgcaaa actgaacatc gatcaaaacc ctggcactgc gccgaaatat 360 
ggcatccgtg gtatcccgac tctgctgctg ttcaaaaacg gtgaagtggc ggcaaccaaa 420 
gtgggtgcac tgtctaaagg tcagttgaaa gagttcctcg acgctaacct ggcgtaagcg 480 
gccgc 485 

<210> 198 
<211> 476 
<212> DNA 

<213> Homo sapiens 



ggcgtaagcg 480 

485 



<400> 198 

aagcttgcca ccatggagac agacacactc ctgctatggg tactgctgct ctgggttcca 60 

ggttccactg gtgacggatc cgtgtcttct gatcaaaatc atttccgagg agaaattgag 120 

gaaaccatca ctgaaagctt tggtaatgat catagcacct tggatgggta ttccagaaga 180 

accaccttgt cttcaaaaat gtatcacacc aaaggacaag aaggttctgt ttgtctccgg 240 

tcatcagact gtgcctcagg attgtgttgt gctagacact tctggtccaa gatctgtaaa 300 

cctgtcctga aagaaggtca agtgtgtacc aagcatagga gaaaaggctc tcatggacta 360 

gaaatattcc agcgttgtta ctgtggagaa ggtctgtctt gccggataca gaaagatcac 420 

catcaagcca gtaattcttc taggcttcac acttgtcaga gacactaagc ggccgc 476 

<210> 199 
<211> 539 
<212> DNA 

<213> Homo sapiens 
<400> 199 

aagcttgcca ccatggagac agacacactc ctgctatggg tactgctgct ctgggttcca 60 
ggttccactg gtgacggatc ctgcgctagt cccacccgcg gaggggacgc gggcgtgcaa 120 
atctgtctcg cctgcaggaa gcgccgaaaa cgctgcatgc gtcacgctat gtgctgcccc 180 
gggaattact gcaaaaatgg aatatgtgtg tcttctgatc aaaatcattt ccgaggagaa 240 
attgaggaaa ccatcactga aagctttggt aatgatcata gcaccttgga tgggtattcc 300 
agaagaacca ccttgtcttc aaaaatgtat cacaccaaag gacaagaagg ttctgtttgt 360 
ctccggtcat cagactgtgc ctcaggattg tgttgtgcta gacacttctg gtccaagatc 420 
tgtaaacctg tcctgaaaga aggtcaagtg tgtaccaagc ataggagaaa aggctctcat 480 
ggactagaaa tattccagcg ttgttactgt ggagaaggtc tgtcttgcta agcggccgc 539 

<210> 200 
<211> 416 
<212> DNA 

<213> Homo sapiens 
<400> 200 

aagcttgcca ccatgggcga taaaattatt cacctgactg acgacagttt tgacacggat 60 
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gtactcaaag cggacggggc gatcctcgtc gatttctggg cagagtggtg cggtccgaat 120 

tcctatgcgt ggttgttttc ttgtagtagg tgtaggtggt ggttgccttg gactagtggt 180 

ccgtgcaaaa tgatcgcccc gattctggat gaaatcgctg acgaatatca gggcaaactg 240 

accgttgcaa aactgaacat cgatcaaaac cctggcactg cgccgaaata tggcatccgt 300 

ggtatcccga ctctgctgct gttcaaaaac ggtgaagtgg cggcaaccaa agtgggtgca 360 

ctgtctaaag gtcagttgaa agagttcctc gacgctaacc tggcgtaagc ggccgc 416 

<210> 201 
<211> 416 
<212> DNA 
<213> Homo sapiens 

<400> 201 

aagcttgcca ccatgggcga taaaattatt cacctgactg acgacagttt tgacacggat 60 
gtactcaaac gccacggggc gatcctcgtc gatttctggg cagagtggtg cggtccgaat 120 
tccatttgtg aggttgtgag gttgtggagt cggtatcctt ggtcttgggt gactagtggt 180 
ccgtgcaaaa tgatcgcccc gattctggat gaaatcgctg acgaatatca gggcaaactg 240 
accgttgcaa aactgaacat cgatcaaaac cctggcactg cgccgaaata tggcatccgt 300 
ggtatcccga ctctgctgct gttcaaaaac ggtgaagtgg cggcaaccaa agtgggtgca 360 
ctgtctaaag gtcagttgaa agagttcctc gacgctaacc tggcgtaagc ggccgc 416 

<210> 202 
<211> 422 
<212> DNA 
<213> Homo sapiens 

<400> 202 

aagcttgcca ccatgggcga taaaattatt cacctgactg acgacagttt tgacacggat 60 
gtactcaaag cggacggggc gatcctcgtc gatttctggg cagagtggtg cggtccgaat 120 
tccggttgta ctagtgcggt gtgtggtgct tgggctgagg cgggtaggtt ttattgtact 180 
agtggtccgt gcaaaatgat cgccccgatt ctggatgaaa tcgctgacga atatcagggc 240 
aaactgaccg ttgcaaaact gaacatcgat caaaaccctg gcactgcgcc gaaatatggc 300 
atccgtggta tcccgactct gctgctgttc aaaaacggtg aagtggcggc aaccaaagtg 360 
ggtgcactgt ctaaaggtca gttgaaagag ttcctcgacg ctaacctggc gtaagcggcc 420 
90 422 

<210> 203 
<211> 416 
<212> DNA 
<213> Homo sapiens 

<400> 203 

aagcttgcca ccatgggcga taaaattatt cacctgactg acgacagttt tgacacggat 60 
gtactcaaag cggacggggc gatcctcgtc gatttctggg cagagtggtg cggtccgaat 120 
tccttgtgga ttgggccggg tgatcagggt ctgtttcggc gttttgtttt tactagtggt 180 
ccgtgcaaaa tgatcgcccc gattctggat gaaatcgctg acgaatatca gggcaaactg 240 
accgttgcaa aactgaacat cgatcaaaac cctggcactg cgccgaaata tggcatccgt 300 
ggtatcccga ctctgctgct gttcaaaaac ggtgaagtgg cggcaaccaa agtgggtgca 360 
ctgtctaaag gtcagttgaa agagttcctc gacgctaacc tggcgtaagc ggccgc 416 



<210> 204 
<211> 154 
<212> PRT 

<213> Homo sapiens 
<400> 204 
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Met Glu 
1 



Thr Asp Thr 
5 



Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 




Gly Ser Thr Gly Asp Gly Ser Met Ser Asp Lys He He His Leu Thr 

20 25 30 

Asp Asp Ser Phe Asp Thr Asp Val Leu Lys Ala Asp Gly Ala He Leu 

35 40 45 

Val Asp Phe Trp Ala Glu Trp Cys Gly Pro Asn Ser Val Val Leu Cys 

50 55 60 

Ser Arg Cys Gly Arg Leu Trp Arg Trp Ser Cys Gly Thr Ser Gly Pro 
65 70 75 80 

Cys Lys Met He Ala Pro He Leu Asp Glu He Ala Asp Glu Tyr Gin 

85 90 95 

Gly Lys Leu Thr Val Ala Lys Leu Asn He Asp Gin Asn Pro Gly Thr 

100 105 110 

Ala Pro Lys Tyr Gly He Arg Gly He Pro Thr Leu Leu Leu Phe Lys 

115 120 125 

Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser Lys Gly Gin 

130 135 140 

Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala 
145 150 

<210> 205 
<211> 154 
<212> PRT 

<213> Homo sapiens 
<400> 205 

Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 

15 10 15 

Gly Ser Thr Gly Asp Gly Ser Met Ser Asp Lys He lie His Leu Thr 

20 25 30 

Asp Asp Ser Phe Asp Thr Asp Val Leu Lys Ala Asp Gly Ala He Leu 

35 40 45 

Val Asp Phe Trp Ala Glu Trp Cys Gly Pro Asn Ser Gly Trp Arg Trp 

50 55 60 

Cys Gly Arg Cys Gly Ala Leu Trp Trp Arg Arg Val Thr Ser Gly Pro 
65 70 75 80 

Cys Lys Met He Ala Pro He Leu Asp Glu He Ala Asp Glu Tyr Gin 

85 90 95 

Gly Lys Leu Thr Val Ala Lys Leu Asn He Asp Gin Asn Pro Gly Thr 

100 105 110 

Ala Pro Lys Tyr Gly He Arg Gly He Pro Thr Leu Leu Leu Phe Lys 

115 120 125 

Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser Lys Gly Gin 

130 135 140 

Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala 
145 150 



<210> 206 
<211> 154 
<212> PRT 
<213> Homo sapiens 

<400> 206 

Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 
15 10 15 
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Car 


rnr 


Gly Asp Gly Ser Met 










ASp 


ASp 


Ser 


Phe Asp Thr Asp Val 








40 


Val 


Asp 


Phe 


Trp Ala Glu Trp Cys 








55 


Val 


Thr 


Cys 


lie Arg Cys Arg Arg 


65 






70 


Cys 


Lys 


Met 


^P ^ ^ ^ _ ~_ - - m — 

lie Ala Pro lie Leu 








85 


Gly 


Lys 


Leu 


Thr Val Ala Lys Leu 








100 


Ala 


Pro 


Lys 


Tyr Gly lie Arg Gly 






115 


120 


Asn 


Gly 


Glu 


Val Ala Ala Thr Lys 




130 




135 


Leu 


Lys 


Glu 


Phe Leu Asp Ala Asn 


145 






150 



Ser Asp Lys He He His Leu Thr 
25 30 
Leu Lys Ala Asp Gly Ala He Leu 

45 

Gly Pro Asn Ser Glu Val Arg Gin 

60 

Gly Phe Leu Leu Thr Ser Gly Pro 

75 80 
Asp Glu He Ala Asp Glu Tyr Gin 

90 95 
Asn He Asp Gin Asn Pro Gly Thr 
105 110 
He Pro Thr Leu Leu Leu Phe Lys 

125 

Val Gly Ala Leu Ser Lys Gly Gin 

140 

Leu Ala 



<210> 207 






<211> 154 






<212> PRT 






<213> Homo sapiens 




<400> 207 






Met Glu 


Thr Asp Thr Leu Leu Leu 


1 




5 




Gly Ser 


Thr 


Gly Asp Gly Ser Met 






20 




Asp Asp 


Ser 


Phe Asp 


Thr Asp Val 




35 




40 


Val Asp 


Phe 


Trp Ala 


Glu Trp Cys 


50 






55 


Met He 


Trp 


Glu Ala 


Trp Ser Cys 


65 






70 


Cys Lys 


Met 


He Ala 


Pro He Levi 






85 




Gly Lys 


Leu 


Thr Val 


Ala Lys Leu 






100 




Ala Pro 


Lys 


Tyr Gly He Arg Gly 




115 




120 


Asn Gly 


Glu 


Val Ala 


Ala Thr Lys 


130 






135 


Leu Lys 


Glu 


Phe Leu 


Asp Ala Asn 


145 






150 



Trp Val Leu Leu Leu Trp Val Pro 

10 15 
Ser Asp Lys He He His Leu Thr 
25 30 
Leu Lys Ala Asp Gly Ala He Leu 

45 

Gly Pro Asn Ser Gly Gly Gly Gly 

60 

Tyr Ala Cys Gly Thr Ser Gly Pro 

75 80 
Asp Glu He Ala Asp Glu Tyr Gin 

90 95 
Asn He Asp Gin Asn Pro Gly Thr 
105 110 
He Pro Thr Leu Leu Leu Phe Lys 

125 

Val Gly Ala Leu Ser Lys Gly Gin 

140 

Leu Ala 



<210> 208 
<211> 154 
<212> PRT 

<213> Homo sapiens 
<400> 208 

Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 

15 10 15 

Gly Ser Thr Gly Asp Gly Ser Met Ser Asp Lys He He His Leu Thr 
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20 


25 








30 


Asd 


Asd 


Ser 


Phe Asd Thr 


Asn Val T.pn 


T \to 
LtyS 


Ala 


Asp Gly Ala He Leu 






35 




40 








A C 

45 


Val 


Asn 


Phe 


Trn Ala ft 1 n 
lip rvx a ox u 


ft\/c ft 1 
iip v*yo uiy 


D >~ f\ 

fro 


7\ «fN 


Ser 


Leu Trp lie Gly 




50 












DU 




Glv 


nap 


ftl r» ^1 w T ah 
Uiil uiy LcU 


rue Arg Hrg 


rne 


vai 


Pne 


Thr Ser Gly Pro 


65 






70 






75 




80 




T.vc 
IJjf o 


IJcL 


XJLe ma rXO 


xie lieu Asp 


Glu 


He 


Ala 


Asp Glu Tyr Gin 












90 






95 




«LyS 


Leu 


Tnr val Ala 


Lys Leu Asn 


He 


Asp 


Gin 


Asn Pro Gly Thr 








100 


105 








110 


Ala 


Pro 


Lys 


Tyr Gly lie 


Arg Gly lie 


Pro 


Thr 


Leu 


Leu Leu Phe Lys 






115 




120 








125 


Asn 


Gly 


Glu 


Val Ala Ala 


Thr Lys Val 


Gly Ala 


Leu 


Ser Lys Gly Gin 




130 






135 






140 


Leu 


Lys 


Glu 


Phe Leu Asp 


Ala Asn Leu 


Ala 








145 






150 













<210> 209 
<211> 151 
<212> PRT 

<213> Homo sapiens 



<400> 209 



Met 


Glu 


Thr Asp 


Thr Leu 


Leu Leu Trp 


Val Leu Leu Leu Trp Val 


Pro 


1 






5 




10 15 




Gly 


Ser 


Thr Gly 


Asp Gly 


Ser Val Ser 


Ser Asp Gin Asn His Phe 


Arg 






20 




25 


30 


Gly 


Glu 


He Glu 


Glu Thr 


He Thr Glu 


Ser Phe Gly Asn Asp His 


Ser 






35 




40 


45 




Thr 


Leu 


Asp Gly 


Tyr Ser 


Arg Arg Thr 


Thr Leu Ser Ser Lys Met 


Tyr 




50 






55 


60 


His 


Thr 


Lys Gly 


Gin Glu 


Gly Ser Val 


Cys Leu Arg Ser Ser Asp Cys 


65 






70 




75 


80 


Ala 


Ser 


Gly Leu 


Cys Cys 


Ala Arg His 


Phe Trp Ser Lys He Cys 


Lys 








85 




90 95 


Pro 


Val 


Leu Lys 


Glu Gly 


Gin Val Cys 


Thr Lys His Arg Arg Lys 


Gly 






100 




105 


110 


Ser 


His 


Gly Leu 


Glu lie 


Phe Gin Arg 


Cys Tyr Cys Gly Glu Gly Leu 






115 




120 


125 




Ser 


Cys 


Arg He 


Gin Lys 


Asp His His 


Gin Ala Ser Asn Ser Ser 


Arg 




130 






135 


140 


Leu 


His 


Thr Cys 


Gin Arg 


His 






145 






150 









<210> 210 
<211> 172 
<212> PRT 
<213> Homo sapiens 



<400> 210 

Met Glu Thr Asp Thr Leu 

1 5 
Gly Ser Thr Gly Asp Gly 

20 

Ala Gly Val Gin He Cys 



Leu Leu Trp Val Leu 

10 

Ser Cys Ala Ser Pro 
25 

Leu Ala Cys Arg Lys 



Leu Leu Trp Val Pro 

15 

Thr Arg Gly Gly Asp 
30 

Arg Arg Lys Arg Cys 
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35 








40 






45 


Met 


Arg 


His 


Ala 


Met 


Cys 


Cvs Pro 


Gly Asn Tyr 


Cys 


T.\/c Den f 1 \r T 1 a 

JjyS A5n \j±y 116 




50 










55 




60 


Cvs 


Val 


Ser Ser Asp Gin 


Asn His 

null ii J- <J 


Phe Arg Gly Glu 


lie vjiu (jiu i nr 


65 










70 




75 




80 


lie 


Thr 


Glu 


Ser 


Phe 


Gly 




His Ser Thr 


Leu 


/\sp vaiy iyr oer 










85 






90 






Ara 


Aro 


Thr 


Thr 


Leu 


Ser 




Met Tyr His 


Thr 


iiys kaiy bin tsiu 








100 








105 




1 1 n 

1 1 u 


Gly 


Ser 


Val 


Cys 


Leu 


Arg 


Ser Ser 


Asp Cys Ala 


Ser 


Gly Leu Cys Cys 






115 








120 






125 


Ala 


Arg 


His 


Phe 


Trp 


Ser 


Lys lie 


Cys Lys Pro 


Val 


Leu Lys Glu Gly 




130 










135 




140 


Gin 


Val 


Cys 


Thr 


Lys 


His 


Arg Arg 


Lys Gly Ser 


His 


Gly Leu Glu He 


145 










150 




155 




160 


Phe 


Gin 


Arg Cys 


Tyr Cys 


Gly Glu 


Gly Leu Ser Cys 





165 170 



<210> 211 
<211> 131 
<212> PRT 

<213> Homo sapiens 
<400> 211 ■ 



Met Gly 


Asp 


Lys 


He He 


His 


Leu 


Thr Asp Asp Ser 


Phe Asp Thr Asp 


1 






5 






10 


15 


Val Leu 


Lys 


Ala 


Asp Gly 


Ala 


He 


Leu Val Asp Phe 


Trp Ala Glu Trp 






20 








25 


30 


Cys Gly 


Pro 


Asn 


Ser Tyr 


Ala 

• 


Trp 


Leu Phe Ser Cys 


Ser Arg Cys Arg 


40 






45 








Trp Trp 


Leu 


Pro 


Trp Thr 


Ser 


Gly 


Pro Cys Lys Met 


He Ala Pro He 


50 








55 




60 




Leu Asp 


Glu 


He 


Ala Asp 


Glu 


Tyr 


Gin Gly Lys Leu 


Thr Val Ala Lys 


65 






70 






75 


80 


Leu Asn 


He 


Asp 


Gin Asn 


Pro Gly 


Thr Ala Pro Lys 


Tyr Gly He Arg 








85 






90 


95 


Gly He 


Pro 


Thr 


Leu Leu 


Leu 


Phe 


Lys Asn Gly Glu Val Ala Ala Thr 






100 








105 


110 


Lys Val 


Gly 


Ala 


Leu Ser 


Lys 


Gly 


Gin Leu Lys Glu 


Phe Leu Asp Ala 




115 








120 




125 


Asn Leu 


Ala 










♦ 





130 



<210> 212 
<211> 131 
<212> PRT 

<213> Homo sapiens 
<400> 212 

Met Gly Asp Lys He He His Leu Thr Asp Asp Ser Phe Asp Thr Asp 

15 10 15 

Val Leu Lys Arg His Gly Ala He Leu Val Asp Phe Trp Ala Glu Trp 

20 25 30 

Cys Gly Pro Asn Ser He Cys Glu Val Val Arg Leu Trp Ser Arg Tyr 

35 40 45 

Pro Trp Ser Trp Val Thr Ser Gly Pro Cys Lys Met He Ala Pro He 
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50 

Leu Asp Glu He 
65 

Leu Asn He Asp 

Gly He Pro Thr 

100 

Lys Val Gly Ala 
115 

Asn Leu Ala 
130 



55 

Ala Asp Glu Tyr 
70 

Gin Asn Pro Gly 
85 

Leu Leu Leu Phe 

Leu Ser Lys Gly 

120 



60 

Gin Gly Lys Leu 
75 

Thr Ala Pro Lys 
90 

Lys Asn Gly Glu 
105 

Gin Leu Lys Glu 



Thr Val Ala Lys 

80 

Tyr Gly He Arg 
95 

Val Ala Ala Thr 
110 

Phe Leu Asp Ala 
125 



<210> 213 
<211> 133 
<212> PRT 

<213> Homo sapiens 



<400> 213 



Met 


Gly Asp 


Lys 


He 


1 








5 


Val 


Leu 


Lys 


Ala 
20 


Asp 


Cys 


Gly 


Pro 
35 


Asn 


Ser 


Glu 


Ala Gly 


Arg 


Phe 




50 








Pro 


He 


Leu 


Asp 


Glu 


65 










Ala 


Lys 


Leu 


Asn 


He 
85 


He 


Arg 


Gly 


He 
100 


Pro 


Ala 


Thr 


Lys 
115 


Val 


Gly 


Asp 


Ala 
130 


Asn 


Leu 


Ala 



He His Leu Thr Asp Asp 

10 

Gly Ala He Leu Val Asp 

25 

Gly Cys Thr Ser Ala Val 
40 

Tyr Cys Thr Ser Gly Pro 
55 

He Ala Asp Glu Tyr Gin 
70 75 
Asp Gin Asn Pro Gly Thr 

90 

Thr Leu Leu Leu Phe Lys 

105 

Ala Leu Ser Lys Gly Gin 
120 



Ser Phe Asp Thr Asp 

15 

Phe Trp Ala Glu Trp 
30 

Cys Gly Ala Trp Ala 
45 

Cys Lys Met lie Ala 
60 

Gly Lys Leu Thr Val 

80 

Ala Pro Lys Tyr Gly 

95 

Asn Gly Glu Val Ala 
110 

Leu Lys Glu Phe Leu 
125 



<210> 214 
<211> 131 
<212> PRT 

<213> Homo sapiens 
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Box I Observations where certain claims were found unsearchable (Continuation of Item 1 of first sheet) 
This international report has not been established in respect of certain claims under Article 17(2) (a) for the following reasons: 

1. \^\ Claim Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



I — I 

2. I | Claim Nos.: 

because they relate to parts of the international application that do not comply with the prescribed requirements to 
such an extent that no meaningful international search can be carried out, specifically: 



3- □ 
6.4(a). 



Claim Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 



Box n Observations where unity of invention is lacking (Continuation of Item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Continuation Sheet 



1. 



2. 



3. 



□ 
□ 
□ 



As all required additional search fees were timely paid by the applicant, this international search report covers all 
searchable claims. 

As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite 
payment of any additional fee. 

As only some of the required additional search fees were timely paid by the applicant, this international search 
report covers only those claims for which fees were paid, specifically claims Nos.: 



4. 



No required additional search fees were timely paid by the applicant. Consequently, this international search report 
is restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 1- 1 i and 13-23 



Remark on Protest 



The additional search fees were accompanied by the applicant's protest. 
No protest accompanied the payment of additional search fees. 
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BOX H. OBSERVATIONS WHERE UNITY OF INVENTION IS LACKING 

This application contains the following inventions or groups of inventions which are not so linked as to form a single general 
inventive concept under PCT Rule 13.1. In order for all inventions to be examined, the appropriate additional examination fees must 
be paid. 

I. Claims 1-11 and 13-23, drawn to a method of regulating LRP5 activity in a subject, comprising administering a composition which 
inhibits Dkk binding to LRP5. 

II. Claims 1-1 1 and 13-20, drawn to a method of regulating LRP5 activity in a subject, comprising administering a composition which 
inhibits Dkk binding to LRP6. 







m 


MM 



which inhibits Dkk binding to HBM. 

IV. Claims 1- 10, 12- 14 and 16-23 drawn to a method of regulating LRP5 activity in a subject, comprising administering a 
composition which enhances Dkk binding to LRP5. 

V. Claims 1-10, 12-14 and 16-20, drawn to a method of regulating LRP5 activity in a subject, comprising administering a 
composition which enhances Dkk binding to LRP6. 

VI. Claims 1-10, 12-14 and 16-20, drawn to a method of regulating LRP5 activity in a subject, comprising administering a 
composition which enhances Dkk binding to HBM. 

VII. Claims 2 - 4, 7- 1 1, 13-20, 21-27, drawn to a method of regulating Dkk-Wnt pathway activity, comprising administering a 
composition which modulates Dkk activity, or interaction with LRP5, wherein said composition inhibits Dkk binding to LRP5. 

VIII. Claims 2 - 4, 7-1 1, 13-20, 24-27, drawn to a method of regulating Dkk- Wnt pathway activity, comprising administering a 
composition which modulates Dkk activity, or interaction with LRP6, wherein said composition inhibits Dkk binding to LRP6. 

IX. Claims 2 - 4, 7- 1 1 , 13-20, 24-27, drawn to a method of regulating Dkk-Wnt pathway activity, comprising administering a 
composition which modulates Dkk activity, or interaction with HBM, wherein said composition inhibits Dkk binding to HBM. 

X. Claims 2 - 4, 7-10, 12-27, drawn to a method of regulating Dkk-Wnt pathway activity, comprising administering a composition 
which modulates Dkk activity, or interaction with RP5, wherein said composition enhances Dkk binding to LRP5. 

XI. Claims 2-4, 7-10, 12-20, 24-27, drawn to a method of regulating Dkk-Wnt pathway activity, comprising administering a 
composition which modulates Dkk activity, or interaction with LRP6, wherein said composition enhances Dkk binding to LRP6. . 

XII. Claims 2 - 4, 7-10, 12-20, 24-27, drawn to a method of regulating Dkk-Wnt pathway activity, comprising administering a 
composition which modulates Dkk activity, or interaction with HBM, wherein said composition enhances Dkk binding to HBM. 

X3II. Claims 2-4, 7-11, 13-23, 28-32 drawn to a method of modulating bone mass in a subject comprising ad^ninistering to the 
subject a composition which modulates DKK activity or interaction with LRP5, wherein said composition inhibits Dldc binding to 
LRP5. 

XIV. Claims 2 - 4, 7- 1 1 , 13-23, 28-32 drawn to a method of modulating bone mass in a subject comprising administering to the 
subject a composition which modulates DKK activity or interaction with LRP6, wherein said composition inhibits Dldc binding to 
LRP6. 

XV. Claims 2-4,7-11, 13-23, 28-32 drawn to a method of modulating bone mass in a subject comprising administering to the 
subject a composition which modulates DKK activity or interaction with HBM, wherein said composition inhibits Dkk bmding to 
HBM. 
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XVI. Claims 2 - 4, 7-10, 12-23, 28-32 drawn to a method of modulating bone mass in a subject comprising administering to the 
subject a composition which modulates DKK activity or interaction with LRP5, wherein said composition enhances Dkk binding to 
LRP5. 

XIV. Claims 2 - 4, 7-10, 12-20, 28-32 drawn to a method of modulating bone mass in a subject comprising administering to the 
subject a composition which modulates DKK activity or interaction with LRP6, wherein said composition enhances Dkk binding to 
LRP6. 

XV. Claims 2 - 4, 7- 10, 12-20, 28-32 drawn to a method of modulating bone mass in a subject comprising administering to the 
subject a composition which modulates DKK activity or interaction with HBM, wherein said composition enhances Dkk binding to 
HBM. 

XVI. Claims 2 - 4, 7-1 1, 13-23, 33-35 drawn to a method of modulating lipid levels in a subject comprising administering to the 
subject a composition which modulates DKK activity or interaction with LRP5, wherein said composition inhibits Dkk binding to 
LRP5. 

XVII. XVI. Claims 2 - 4, 7- 1 1 , 13-23, 33-35 drawn to a method of modulating lipid levels in a subject comprising adniinistering to 
the subject a composition which modulates DKK activity or interaction with LRP6, wherein said composition inhibits Dkk binding to 
LRP6. 

XVIII. Claims 2 - 4, 7-1 1 1 13-23, 33-35 drawn to a method of modulating lipid levels in a subject comprising administering to the 
subject a composition which modulates DKK activity or interaction with HBM, wherein said composition inhibits Dkk binding to 
HBM. 



XIX. Claims 2-4, 7-10, 12-23, 33-35 drawn to a method of modulating lipid levels in a subject comprising aaministering to the 
subject a composition which modulates DKK activity or interaction with LRP5, wherein said composition enhances Dkk binding to 
LRP5. 

XX. Claims 2 - 4, 7- 10, 12-20, 33-35 drawn to a method of modulating lipid levels in a subject comprising administering to the 
subject a composition which modulates DKK activity or interaction with LRP6, wherein said composition enhances Dkk binding to 
LRP6. 

XXI. Claims 2 - 4, 7-10, 12-20, 33-35 drawn to a method of modulating lipid levels in a subject comprising adniinistering to the 
subject a composition which modulates DKK activity or interaction with HBM, wherein said composition enhances Dkk binding to 
HBM. 



XXII. Claims 2, 35 and 36 drawn to a method of diagnosing low or high bone mass 1 
and /or high or low lipid levels in a subject comprising examining expression of Dkk. 

XXIII. Claims 2, 35 and 36 drawn to a method of diagnosing low or high bone mass 
and /or high or low lipid levels in a subject comprising examining expression of LRP5. 

XXIV. Claims 2, 35 and 36 drawn to a method of diagnosing low or high bone mass 
and /or high or low lipid levels in a subject comprising examining expression of LRP6. 

XXV. Claims 2, 35 and 36 drawn to a method of diagnosing low or high bone mass 
and /or high or low lipid levels in a subject comprising examining expression of HBM. 

XXVI. Claims 2, 37-43, 44-47, drawn to a method of screening for a compound which modulates the interaction of DKK with LRP5. 

XXVII. Claims 2, 37-43, 47, drawn to a method of screening for a compound which modulates the interaction of DKK with LRP6. 

XXVIII. Claims 2, 37-43, 47, drawn to a method of screening for a compound which modulates the interaction of DKK with HBM. 

XXIX. Claims .2,41-43, 48-49, drawn to a method of screening for a compound which modulates (he interaction of Dkk with a Dkk 
interacting proteins. 
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XXX. Claims 50-63 drawn to a composition comprising a LRP5 and a pharmaceutical acceptable carrier thereof. 

XXXI. Claims 50- 59, 63 drawn to a composition comprising a LRP6 and a pharmaceutical acceptable carrier thereof. 

XXXII. Claims 50- 59, 63 drawn to a composition comprising a HBM and a pharmaceutical acceptable carrier thereof. 

XXXIII. Claims 2 and 64 drawn to a method for identifying compound which modulate Dkk and LRP5 interaction. 

XXXIV. Claims 2 and 64 drawn to a method for identifying compound which modulate Dkk and LRP6 interaction. 

XXXV. Claims 2 and 64 drawn to a method for identifying compound which modulate Dkk and HBM interaction. 

XXXVI. Claims 2 and 65 drawn to a method of identifying binding partners for a Dkk protein. 

XXXVII. Claims 66-68 drawn to a nucleic acid and a vector encoding a Dkk interacting protein. 

XXXVIII. Claims 69-90, drawn to a method of detecting a modulatory activity of a compound, wherein the first peptide is a Dkk 
peptide and the second peptide is a LRP5. 

XXXIX. Claims 69-87, drawn to a method of detecting a modulatory activity of a compound, wherein the first peptide is a Dkk 
peptide and the second peptide is a LRP6. 

XL. Claims 69-87, drawn to a method of detecting a modulatory activity of a compound, wherein the first peptide is a Dkk peptide 
and the second peptide is a HBM. 

XLI. Claims 91-92 drawn to a transgenic animal. 

XLII. Claims 2 and 93 drawn to a method for identifying potential compound which modulate Dkk activity. 

XLIII - LXII. Claim 94, drawn to one specific peptide aptamer of one specific SEQ ID NOs : 171-88; 189-192. 

LXIII- LXXIX. Claims 95-97, drawn to an antibody which specifically recognizes and binds to specific peptides of SEQ ID NOs : 
110-127. 

LXXX . Claims 2, 98-100 , drawn to a method of identifying Dkk interacting protein which modulate the interaction of Dkk with 
Wnt signaling pathway. 

LXXXI. Claims 2, 25 and 101-104, drawn to a method for identifying Dkk interacting proteins. 
LXXXII. Claims 2, 105-106, drawn to a method for identifying compounds which modulate Dkk and LRP5 interaction. 
LXXX3IL Claims 2, 105- 106, drawn to a method for identifying compounds which modulate Dkk and LRP6 interaction. 
LXXXIV. Claims 2, 105- 106, drawn to a method for identifying compounds which modulate Dkk and HBM interaction. 

* 

LXXXV. Claims 2, 25, 107-1 10, drawn to a method for identifying compound which modulate the interaction of Dkk with Wnt 
signaling pathway. 

LXXXVI. Claims 2, 111, drawn to a method of testing compounds that modulate Dkk-mediated activity in a mammal. 

LXXXVII. Claims 2, 112, 113, drawn to method of screening for compound or composition which modulate the interaction of Dkk 
and Dkk interacting protein. 

LXXXV1II-CIX. Claim 114 drawn to antibody which recognizes and binds to one specific SEQ ID NOs: 171-192. 
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The inventions listed as Groups 1-109 do not relate to a single general inventive concept under PCT Rule 13.1 because, under PCT 
Rule 13.2, they lack the same or corresponding special technical features for the following reasons: 

The invention listed as groups 1- 109 do not related to a single general inventive concept under PCT Rule 13.1 because, under PCT 
Rule 13.2, hey lack the same or corresponding special technical features for the following reasons: 

The special technical features of Group I is considered a method of regulating LRP5 activity in a subject, comprising administering a 
composition which inhibits Dkk bmding to LRP5. 

The special technical features of Group II is considered a method of regulating LRP5 activity in a subject, comprising administering 
a composition which inhibits Dkk binding to LRP6. 

The special technical features of Group III is considered a method of regulating LRP5 activity in a subject, comprising administering 
a composition which inhibits Dkk binding to HBM. 

The special technical features of Group IV is considered a method of regulating LRP5 activity in a subject, comprising adrninistering 
a composition which enhances Dkk binding to LRP5. 

The special technical features of Group V is considered a method of regulating LRP5 activity in a subject, comprising adrninistering a 
composition which enhances Dkk binding to LRP6. 

The special technical features of Group VI is considered a method of regulating LRP5 activity in a subject, comprising administering 
a composition which enhances Dkk binding to HBM. 
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The special technical features of Group VII is considered a method of regulating Dkk-Wnt pathway activity, comprising administering 
a composition which modulates Dkk activity, or interaction with LRP5, wherein said composition inhibits Dkk binding to LRP5. 

The special technical features of Group VIII is considered a method of regulating Dkk-Wnt pathway activity, comprising 
administering a composition which modulates Dkk activity, or interaction with LRP6, wherein said composition inhibits Dkk binding 
to LRP6. 

The special technical features of Group IX is considered a method of regulating Dkk-Wnt pathway activity, comprising administering 
a composition which modulates Dkk activity, or interaction with HBM, wherein said composition inhibits Dkk binding to HBM. 

The special technical features of Group X is considered a method of regulating Dkk-Wnt pathway activity, comprising administering a 
composition which modulates Dkk activity, or interaction with RP5, wherein said composition enhances Dkk binding to LRP5. 

The special technical features of Group XI is considered a method of regulating Dkk-Wnt pathway activity, comprising acmiinistering 
a composition which modulates Dkk activity, or interaction with LRP6, wherein said composition enhances Dkk binding to LRP6. 

The special technical features of Group XII is considered a method of regulating Dkk-Wnt pathway activity, comprising administering 
a composition which modulates Dkk activity, or interaction with HBM, wherein said composition enhances Dkk binding to HBM. 

The special technical features of Group XIII is considered a method of modulating bone mass in a subject comprising administering to 
the subject a composition which modulates DKK activity or interaction with LRP5, wherein said composition inhibits Dkk binding to 
LRP5. 

The special technical features of Group XIV is considered a method of modulating bone mass in a subject comprising administering to 
the subject a composition which modulates DKK activity or interaction with LRP6, wherein said composition inhibits Dkk binding to 
LRP6. 

The special technical features of Group XV is considered a method of modulating bone mass in a subject comprising administering to 
the subject a composition which modulates DKK activity or interaction with HBM, wherein said composition inhibits Dkk binding to 
HBM. . . . 

The special technical features of Group XVI is considered a method of modulating bone mass in a subject comprising administering to 
the subject a composition which modulates DKK activity or interaction with LRP5, wherein said composition enhances Dkk binding 
to LRP5. 

The special technical features of Group XIV is considered a method of modulating bone mass in a subject comprising administering to 
the subject a composition which modulates DKK activity or interaction with LRP6, wherein said composition enhances Dkk binding 
to LRP6. 

The special technical features of Group XV is considered a method of modulating bone mass in a subject comprising administering to 
the subject a composition which modulates DKK activity or interaction with HBM, wherein said composition enhances Dkk binding 
to HBM. 

The special technical features of Group XVI is considered a method of modulating lipid levels in a subject comprising administering 
to the subject a composition which modulates DKK activity or interaction with LRP5, wherein said composition inhibits Dkk binding 
to LRP5. 

The special technical features of Group XVII is considered a method of modulating lipid levels in a subject comprising administering 
to the subject a composition which modulates DKK activity or interaction with LRP6, wherein said composition inhibits Dkk binding 
to LRP6. 

The special technical features of Group XVIII is considered a method of modulating lipid levels in a subject comprising adniinistering 
to the subject a composition which modulates DKK activity or interaction with HBM, wherein said composition inhibits Dkk binding 
to HBM. 



The special technical features of Group XIX is considered a method of modulating lipid levels in a subject comprising administering 
to the subject a composition which modulates DKK activity or interaction with LRP5, wherein said composition enhances Dkk 
binding to LRP5. 
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The special technical features of Group XX is considered a method of modulating lipid levels in a subject comprising administering to 
the subject a composition which modulates DKK activity or interaction with LRP6, wherein said composition enhances Dkk binding 
to LRP6. 



The special technical features of Group XXI is considered a method of modulating lipid levels in a subject comprising administering 
to the subject a composition which modulates DKK activity or interaction with HBM, wherein said composition enhances Dkk 
binding to HBM. 

The special technical features of Group XXII is considered a method of diagnosing low or high bone mass and /or high or low lipid 
levels in a subject comprising exannning expression of Dkk. 

The special technical features of Group XXIII is considered a method of diagnosing low or high bone mass and /or high or low lipid 
levels in a subject comprising examining expression of LRP5. 

The special technical features of Group XXIV is considered a method of diagnosing low or high bone mass and /or high or low lipid 
levels in a subject comprising examining expression of LRP6. 

The special technical features of Group XXV is considered a method of diagnosing low or high bone mass and /or high or low lipid 
levels in a subject comprising exarnining expression of HBM. 

The special technical features of Group XXVI is considered a method of screening for a compound which modulates the interaction of 
DKK with LRP5. 

The special technical features of Group XXVII is considered a method of screening for a compound which modulates the interaction 
of DKK with LRP6. 

The special technical features of Group XXVIII is considered a method of screening for a compound which modulates the interaction 
of DKK with HBM. 

The special technical features of Group XXIX is considered a method of screening for a compound which modulates the interaction of 
Dkk with a Dkk interacting proteins . 

The special technical features of Group XXX is considered a composition comprising a LRP5 and a pharmaceutical acceptable carrier 
thereof. 



The special technical features of Group XXXI is considered a composition comprising a LRP6 and a pharmaceutical acceptable carrier 
thereof. 

The special technical features of Group XXXII is considered a composition comprising a HBM and a pharmaceutical acceptable 
carrier thereof. . 

The special technical features of Group XXXIII is considered a method for identifying compound which modulate Dkk and LRP5 
interaction. 

The special technical features of Group XXXIV is considered a method for identifying compound which modulate Dkk and LRP6 
interaction. 

The special technical features of Group XXXV is considered a method for identifying compound which modulate Dkk and HBM 
interaction. 

The special technical features of Group XXXVI is considered a method of identifying binding partners for a Dkk protein. 

The special technical features of Group XXXVII is considered a nucleic acid and a vector encoding a Dkk interacting protein. 

The special technical features of Group XXXVIII is considered a method of detecting a modulatory activity of a compound, wherein 
the first peptide is a Dkk peptide and the second peptide is a LRP5. 
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The special technical features of Group XXXIX is considered a method of detecting a modulatory activity of a compound, wherein 
the first peptide is a Dkk peptide and the second peptide is a LRP6. 

The special technical features of Group XL is considered a method of detecting a modulatory activity of a compound, wherein the 
first peptide is a Dkk peptide and the second peptide is a HBM. 

The special technical features of Group XLI is considered a transgenic animal. 

The special technical features of Group XLII is considered a method for identifying potential compound which modulate Dkk activity. 

The special technical features of Group XLIII - LXII is considered one specific peptide aptamer of one specific SEQ ID NOs : 171- 
88; 189-192. 

The special technical features of Group LXIII- LXXIX is considered an antibody which specifically recognizes and binds to specific 
peptides of SEQ ID NOs : 1 10-127. 

The special technical features of Group LXXX is considered a method of identifying Dkk interacting protein which modulate the 
interaction of Dkk with Wnt signaling pathway. 

The special technical features of Group LXXX1 is considered a method for identifying Dkk interacting proteins. 

The special technical features of Group LXXXII is considered a method for identifying compounds which modulate Dkk and LRP5 
interaction. 

The special technical features of Group LXXXIII is considered a method for identifying compounds which modulate Dkk and LRP6 
interaction. 

The special technical features of Group LXXXIV is considered a method for identifying compounds which modulate Dkk and HBM 
interaction. 

The special technical features of Group £,XXXV is considered a method for identifying compound which modulate the interaction of 
Dkk with Wnt signaling pathway. 

■ 

The special technical features of Group LXXXVI is considered a method of testing compounds that modulate Dkk-mediated activity in 
a mammal. 

The special technical features of Group LXXXVII is considered a method of screening for compound or composition which modulate 
the interaction of Dkk and Dkk interacting protein. 

The special technical features of Group LXXXVTII-CIX. is considered an antibody which recognizes and binds to one specific SEQ 
ID NOs: 171-192. 

Accordingly, Groups I-CIX are not so linked by the same or corresponding special technical feature within meaning of PCT Rule 13.2 
so as to form a single general inventive concept. 
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